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Filamentous fungal mutants with improved homologous 

recombination efficiency 

Field of the invention 
The invention relates to the field of molecular biology. It particularly relates to methods to 
improve the efficiency of directed integration of nucleic acids into the genome of a 
filamentous fungus and uses thereof. 

Background of the Invention 

Eukaryotic cells are preferred organisms for recombinant production of polypeptides and 
secondary metabolites. When constructing, for example, a protein production strain, the 
site of integration of the gene of interest coding for the protein to be produced is crucial 
as to the likelihood of transcription and/or expression of the integrated gene of interest. 
Since in most eukaryotic organisms integration of DNA into the genome occurs with high 
frequency at random, the construction of a protein production strain by recombinant DNA 
technology often leads to the unwanted integration of the expression cassette 
comprising the gene encoding the protein to be .produced at random. This uncontrolled 
"at random multiple integration" of an expression cassette is a potentially dangerous 
process, which can lead to unwanted modification of the genome of the host. It is 
therefore highly desirable to be able to construct a protein production strain by ensuring 
the targeting of the expression cassette with high efficiency. 

Furthermore, how that the sequence of complete genomes of an increasing amount of 
organisms is becoming available, . this opens the opportunity to construct genome 
spanning overexpression and deletion libraries. An important requirement for the efficient 
construction of such libraries is that the organism in question can be efficiently 
transformed and that the required homology needed to direct targeted integration of a 
nucleic acid into the genome is relatively short. 

Eukaryotic cells have at least two separate pathways (one via homologous and one via 
non-homologous recombination) through which nucleic acids (in particular of course 
DNA) can be integrated into the host genome. The yeast Saccharomyces cerevisiae is 
an organism with a preference for homologous recombination (HR). The ratio of non- 
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homologous to homologous recombination (NHR/HR) of this organism may vary from 
about 0.07 to 0.007. 

WO 02/052026 discloses mutants of Saccharomyces cerevlsiae having an improved 
5 targeting efficiency of DNA sequences into its genome. Such mutant strains are deficient 
in a gene involved in NHR (ku70). 

Contrary to Saccharomyces cerevisiae, most filamentous fungal cells have a preference 
for NHR. Among filamentous fungi, the NHR/HR ratio is ranged between 1 and more 

10 than 100. In such organisms, targeted integration frequency is rather low. To improve 
this frequency, the length of homologous regions flanking a polynucleotide sequence to 
be integrated into the genome of such organisms has to be relatively long for example at 
least 2000bp for disrupting a single gene and at least 500bp for screening putative 
transformants. The necessity of such flanking regions represents a heavy burden when 

15 cloning the DNA construct comprising said polynucleotide and when transforming the 
organism with it. Moreover, neighbouring genes which lie within those flanking regions 
can easily be disturbed during the recombination processes following transformation, 
thereby causing unwanted and unexpected side-effects. 

20 Mammalian cells deficient in ku70 have already been cloned (Pierce et al. Genes and 
Development, ( 2001), 15: 3237-3242). These mutants have a sixfold higher homolog y- 
directed repair frequency, but no increase in the efficiency of homology-directed targeted 
integration. This suggests that results obtained in organisms with a preference for HR 
(Saccharomyces cerevisiae) cannot be extrapolated to organisms with a preference for 

25 NHR. 

Surprisingly, we found that steering the integration pathways of nucleic acids towards 
HR in filamentous fungi resulted in an improved efficiency for targeted integration of 
nucleic acids into the genome of filamentous fungi. 

30 

Detailed description of the invention 

All patents and publications, Including all sequences and methods disclosed within such 
patents and publications, referred to herein are expressly incorporated by reference. 
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These patents and publications include: EP 357 127 B, EP 635 574 B, WO 97/06261, 
WO 98/46772. 

The present invention provides a method for increasing the efficiency of targeted 
5 integration of a polynucleotide to a pre-determined site into the genome of a filamentous 
fungal cell, with a preference for NHR, wherein said polynucleotide has a region of 
homology with said pre-determined site comprising steering an integration pathway 
towards HR. The present invention arrives at such steering either by elevating the 
efficiency of the HR pathway, and/or by lowering (meaning reducing) the efficiency of the 
10 NHR pathway and/or by decreasing the NHR/HR ratio. 

In the context of the invention, the HR pathway is defined as all genes and elements 
being involved in the control of the targeted integration of polynucleotides into the 
genome of a host, said polynucleotides having a certain homology with a certain pre- 
15 determined site of the genome of a host wherein the integration is targeted. The NHR 
pathway is defined as all genes and elements being involved in the control of the 
integration of polynucleotides into the genome of a host, irrespective of the degree of 
homology of the said polynucleotides with the genome sequence of the host. 

20 According to a preferred embodiment, the steering comprises providing a mutant of a 
parent filamentous fungal ceil, wherein the NHR/HR ratio is decreased in the mutant of 
at least 5% as compared to said ratio in said parent organism as measured by the 
following assay. More preferably, the NHR/HR ratio is decreased in the mutant of at least 
10%, even more preferably at least 50% and most preferably at least 100% as compared 

25 to said ratio in said parent organism. According to another preferred embodiment, the 

filamentous fungal cell obtained has a ratio NHR/HR, which is at least 200, at least 50, at 
least 1 0 as measured by the following assay. Preferably the ratio of the filamentous 
fungal cell is at least 1, more preferably at least 0,5, even more preferably at least 0,1, 
even more preferably at least 0,05, even more preferably at least 0,01 even more 

30 preferably at least 0,005 even more preferably at least 0,001 even more preferably at 
least 0,0005 even more preferably at least 0,0001 and most preferably at least 0,00001 . 
The ratio of NHR/HR can be measured by the assay as described in WO 02/052026 • 
(table 2, p23). According to a preferred embodiment, the parent organism is the 
filamentous fungus cell Aspergillus n/gerCBS 513.88, Aspergillus oryzae ATCC 20423, 
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IFO 4177, ATCC 1011, ATCC 9576, ATCC14488-14491, ATCC 11601, ATCC 12892, P. 
chrysogenum CBS 455.95, Penicillium cltrinum ATCC 38065, Penicillium chrysogenum 
P2, Acremonium chrysogenum ATCC 36225 or ATCC 48272, Trichoderma reesei 
ATCC 26921 or ATCC 56765 or ATCC 26921 , Aspergillus sojae ATCC1 1 906, 
5 Chrysosporium lucknowense ATCC44006 and derivatives thereof. 

Alternatively, the NHR/HR ratio in a filamentous fungus can be monitored using 
techniques known to the skilled person such as transcriptional profiling and/or northern 
blotting and/or western blotting of at least one of the following components involved in 
, 10 such pathways: ku70, ku80, rnrell, rad50, rad51, rad52, xrs2, sir4, Iig4. Preferably, the 
expression level of homologous components of the Ku complex are monitored. More 
preferably, the expression level of homologous of ku70 and/or ku80 is monitored. 

In the context of this invention, "a region of homology" means "at least one" region of 
15 homology. A pre-determined site is herein defined as a site within the genetic material 
contained by a host cell to which a polynucleotide with homology to this same site is 
integrated with a method according to the invention. 

In a preferred embodiment, the invention provides a method for increasing the efficiency 
20 of targeted integration of a polynucleotide to a pre-determined site, whereby said 
polynucleotide has homology at or around the said predet ermined site, in a filamentous 
fungus with a preference for NHR comprising steering an integration pathway towards 
HR by providing a filamentous fungus, wherein the efficiency of the NHR pathway has 
been lowered and/or the NHR/HR ratio has been decreased compared to the efficiency 
25 of the NHR pathway and/or the NHR/HR ratio of the filamentous fungus it originates from 
under the same conditions. According to a preferred embodiment, the parent organism is 
the filamentous fungus Aspergillus niger CBS 513.88, Aspergillus oryzae ATCC 20423, 
IFO 4177, ATCC 1011, ATCC 9576, ATCC 14488-1 4491 , ATCC 11601, ATCC12892, 
Penicillium chrysogenum CBS 455.95, Penicillium citrinum ATCC 38065, Penicillium 
30 chrysogenum P2, Acremonium chrysogenum ATCC 36225 or ATCC 48272, Tnchoderma 
reesei ATCC 26921 or ATCC 56765 or ATCC 26921, Aspergillus sojae ATCC11906, 
Chrysosporium lucknowense AT.CC44006 or derivatives thereof. 
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The Efficiency of the NHR pathway can be measured in the following assay as described 
in WO02/052026. Alternatively, the efficiency of the NHR pathway in a filamentous 

r 

fungus can be monitored using techniques known to the skilled person such as 
transcriptional profiling and/or northern blotting and/or western blotting of components 
involved in such pathway. Preferably, the expression level of at least one of the following 
components is monitored: ku70, ku80, mre11, rad50, rad51, rad52, xrs2, sir4, Iig4. 
Preferably, the expression level of homologous components of the Ku complex are 
monitored. More preferably, the expression level of homologous ku70 and/or ku80 is 
monitored. 



A lowered NHR efficiency means at least lower than in the parental cell the obtained cell 
originates from. Preferably, lowered means twice lower, more preferably ten times lower, 
even more preferably 100 times lower, most preferably more than 1000 times lower and 
even most preferably not detectable using northern or western blotting, array techniques 

15 or a phenotypic screen. 

A typical phenotypic screen that could be used comprises the following steps: 
transforming the putative NHR mutants with an expression cassette comprising a. 
selection marker gene flanked by homologous sequences of a predetermined genomic 
site. The selection marker gene used in this phenotypic screen can be selected from a 
20 number of marker genes that are useful for transformation of filamentous fungi. By way 
of example these markers include but are not limited to dominant and bi-directional 
selection marker gene such as an acetamidase (amdS ) gene (EP 635 574 B or WO 
97/06261). auxotrophic marker genes such as argB, trpC.or pyrG and antibiotic 
resistance genes providing resistance, against e.g. phleomycin (the product encoded by 
25 the ble gene confers resistence to phleomycine), hygromycin B or G418. A preferred 
selection marker gene is the ble gene encoding a protein conferring resistence to 
phleomycin. The putative NHR mutants already contain at this predetermined genomic 
site a birectional selection marker gene such as an amdS gene, nitrate reductase gene 
(niaD). sulphate permease (Sut B) gene or PyrG gene. The niaD gene has already been 
. 30 described elsewhere (Gouka RJ, van Hartingsveldt W, Bovenberg RA. van den Hondel 
CA, van Gorcom RF. Cloning of the nitrate-nitrite reductase gene cluster of Penicillium 
chrysogenum and use of the niaD gene as a homologous selection marker. J Biotechnol. 
1991 Sep;20(2): 189-99). The niaD gene enables direct selection of transformants on 
plates containing chlorate, as cells become resistant to chlorate. The sutB gene has 
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already been described elsewhere (van de Kamp M, Pizzinini E, Vos A, van der Lende 
TR, Schuurs TA, Newbert RW, Turner G, Konings WN, Driessen AJ. Sulfate transport in 
Penicillium chrysogenum: cloning and characterization of the sutA and sutB genes. J. 
Bacteriol. 1999 Dec;181(23):7228-34), A preferred selection marker gene is the 
5 A.nidulans amdS coding sequence fused to the A.nidulans gpdA promoter (EP635 574 
B). AmdS genes from other filamentous fungus may also be used (WO 97/06261 ). In the 
preferred form of the phenotypic screen (ble and amdS genes used as selection marker 
genes), in non-HR-improved mutants the ble-cassette will integrate randomly in the 
genome, enabling many transformants to grow on a double selective medium with 
10 acetamide and phleomycin. In mutants with improved HR there will be a limited number 
of transformants on the double selective plates as the amdS-cassette is exchanged with 
the ble-cassette. In this case more mutants will appear on a fluoracetarnide-phleomycin 
double selective plate. 

According to another preferred embodiment, the filamentous fungus having a lowered 
15 NHR efficiency and/or a decreased NHR/HR ratio is a filamentous fungus wherein a 
component Involved in NHR has been inhibited. In this context, "a" means "at least one": 
at least one component involved in NHR has been inhibited in a given filamentous 
fungus. Inhibition can be achieved by down regulating the expression level of a gene 
involved in NHR or inactivating a gene encoding a component involved in NHR and/or by 
20 down regulating the expression level of a component involved in NHR, and/or 

(temporarily) decreasing the (protein) activity of a component involved in NHR and a 

combination of these possibilities. 

Preferably, the filamentous fungus obtained has the expression of a gene involved in 
25 NHR down regulated by comparison to the expression of said gene in the parent 
filamentous fungal cell it originates from under the same conditions. According to a 
preferred embodiment, the parent filamentous fungus is the filamentous fungus 
Aspergillus niger CBS 513.88, Aspergillus oryzae ATCC 20423, IFO 4177, ATCC 1011, 
ATCC 9576, ATCC 14488-1 449 1 , ATCC 11601 f ATCC12892, Penicillium chrysogenum • 
30 CBS 455.95. or Penicillium citrinum ATCC 38065, Penicillium chrysogenum P2, 
Acremonium chrysogenum ATCC 36225 or ATCC 48272, Thchoderma reesei ATCC 26921 
or ATCC 56765 or ATCC 26921, Aspergillus sojae ATCC 11 906,. Chrysosporium 
lucknowense ATCC44006 or derivatives thereof. 

i 
i 
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The expression level of a gene, or a DNA sequence Is down ■ regulated when the 
expression level of this specific gene or DNA sequence in the obtained filamentous 
fungus is lower than the expression level of the same gene or DNA sequence in the 
parental filamentous fungus it originates from, preferably three times lower, more 
preferably four times lower, most preferably more than four times lower and even most 
preferably not detectable using northern, or western blotting or array technique. 

The down and/or up regulation of the expression level of a DNA sequence can be 
monitored by quantifying the amount of corresponding mRNA present in a cell by 
northern blotting (in Molecular Cloning: A Laboratory Manual, Sambrook et e/., New 
York: Cold Spring Harbour Press,1989) for example and/or by quantifying the amount of 
corresponding protein present in a cell by western blotting for example. The difference 
in mRNA amount may also be quantified by DNA array analysis (Eisen, M.B. and Brown, 
P.O. DNA arrays for analysis of gene expression. Methods Enzymol. 1999:303:179- 
205). 

The down regulation of the expression level of at least one gene or DNA sequence may 
be obtained by genetic manipulation by one of the following techniques or by a 

combination thereof: 
20 a. using recombinant genetic manipulation techniques, 

b. submitting the filamentous fungus to mutagenesis. 
Alternatively or in combination with above-mentioned techniques and according to 
another preferred embodiment, the down regulation of the expression level of at least 
one gene or DNA sequence may be obtained by submitting the filamentous fungus to a 
25 inhibiting compound/composition. 

The filamentous fungus obtained may be subsequently selected by monitoring the 
expression level of said gene or DNA sequence. Optionally, the filamentous fungus is 
subsequently selected by measuring its efficiency of the NHR and/or of the HR pathways 
and/or its NHR/HR ratio. In the context of the invention; the efficiency of the HR pathway 
30 of a filamentous fungus may be measured by the efficiency of the targeted integration of 
a given poynucleotide sequence into a pre-determined site in the genome of the 
filamentous fungus using-given' homology region(s). In the context of the invention, the 
efficiency of the NHR pathway of a filamentous fungus may be measured by the 
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efficiency of the non targeted integration of a given polynucleotide sequence in the 
genome of the filamentous fungus irrespective of any homology reglon(s). 

More preferably, the down regulation of the expression of at least one DNA sequence is 
5 made with recombinant genetic manipulation techniques such as defined in step a. to 
obtain a recombinant filamentous fungus. Most preferably step a. comprises deleting the 
DNA sequence, even most preferably the deleted DNA sequence is replaced by a non- 
functional variant thereof, and even most preferably the deletion and replacement are 
made by gene replacement preferably as described in EP 357127 B. 
10 In cases of deletion or replacement of at feast one DNA sequence of the chosen 
filamentous fungus, an appropriate DNA sequence has to be introduced at the target 
locus. The target locus is in this case the DNA sequence involved in NHR pathway to be 
deleted or replaced. The appropriate DNA sequence is preferably present on a cloning 
vector. Suitable cloning vector are the ones that are able to integrate at the pre- 

15 determined target locus in the chromosomes of the filamentous fungal host cell used. 
Preferred integrative cloning vector comprises a DNA fragment, which is homologous to 
the DNA sequence to be deleted or replaced for targeting the integration of the cloning 
vector to this pre-determined locus. In order to promote targeted integration, the cloning 
vector is preferably linearized prior to transformation of the host cell. Preferably, 

20 linearization is performed such that at least one but preferably either end of the cloning 

vector is flanked by sequences homologous to the DNA sequence to be deleted or 

replaced. 

The length of the homologous sequences flanking the DNA sequence to be deleted or 
replaced is preferably less than 2 kb, even preferably less than 1 kb ( even more 
25 preferably less than 0.5kb, even more preferably than 0.2kb, even more preferably 
less than 0.1 kb, even more preferably less than 50bp and most preferably less than 
30bp. 

The selection marker gene in the cloning vector can be selected from a number of 
marker genes that are useful for transformation of filamentous fungi. By way of example* - 
30 these markers include but are not limited to dominant and bi-directional selection marker 
gene such as an acetamidase (amdS ) gene (EP 635 574 B or WO 97/06261), 
auxotrophic marker genes such as argB, trpC,or pyrG and antibiotic resistance genes 
providing resistance against e.g. phleomycin, hygromycin B or G418. A preferred 
selection marker gene is the A.nidulans amdS coding sequence fused to the Anidutans 



gpdA promoter (EP635 574 B). AmdS genes from other filamentous fungus may also be 
used (WO 97/06261). The amdS selection marker gene has the advantage it can be 
used several times in the same strain to replace and/or delete distinct DNA sequences. 
By means of counterselection on fluoracetamide media as described in EP 635 574 B, 
the resulting strain is marker free and can be used for further gene modifications. 
A preferred strategy for down regulating the expression of a given DNA sequence 
comprises the deletion of the wild type DNA sequence and/or replacement by a modified 
DNA sequence, whose expression product Is not functional- The deletion and the 
replacement are preferably performed by the gene replacement technique described in 
EP 0 357 127 Bl.The specific deletion of a gene is preferably performed using the amdS 
gene as selection marker gene as described in EP 635 574 B. 

Alternatively or in combination with other mentioned techniques, a technique based on in 
vivo recombination of cosmids in E. co// can be used, as described in: A rapid method for 
efficient gene replacement in the filamentous fungus Aspergillus nidulans (2000) 
Chaveroche, M-K., Ghico, J-M. and d'Enfert C; Nucleic acids Research, vol 28, no 22. 
This technique is applicable to other filamentous fungi like for example A, niger. 

Down regulating the expression of a DNA sequence may also be achieved by using anti 
sense nucleic acids, or by UV or chemical mutagenesis (Mattern, I.E., van Noort J.M., 
van den Berg. P., Archer, D. B., Roberts, I.N. and van den Hondel, C. A., Isolation and 
characterization of mutants of Aspergillus niger deficient in extracellular proteases. Mol 
Gen Genet. 1992 Aug;234(2):332-6.)< 

Preferably, the deficiency brought in the NHR pathway is an inducible one. This can be 
reached by replacing the endogenous regulatory regions of the gene encoding the 
component involved in NHR by new regulatory regions, preferably by using a repressible 
or regulatable promoter, more preferably by using a promoter that can be switch on/off: 
by glucose repression, or ammonia repression, or pH repression. Examples of glucose- 
repressed promoters are the Penicillium chrysogenum pcbAB promoter (Martin ■ JF, 
Casqueiro J, Kosalkova K, Marcos AT, Gutierrez S. Penicillin and cephalosporin 
biosynthesis: mechanism of carbon cataboMe regulation of penicillin production. Antonie 
( Van- Leeuwenhoek. 1999 Jan-Feb;75(1-2):21-31. Review.) ,.or the Aspergillus niger 
glucoamylase promoter. Examples of on/off switchable promoters are described in the 
following publications: 
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- An activator/repressor dual system allows tight tetracycline-regufated gene expression 
in budding yeast: Belli et al, (1998) Nucl. Add Research, vol 26, n.4:942-947, 

- A light-switchable gene promoter system: Shimizu-Sato_et al, (2002) Nat. Biotech. Vol 
20, no 10:1041-1044. 

4 

According to a preferred embodiment, the filamentous fungus is deficient in at least one 
of its endogenous genes, which are homologous with the following yeast genes involved 
in the NHR pathway KU70, KU80, RAD50, MRE11, XRS2 and SIR4 (van den Bosch et 
al (2002): DNA double-strand break repair by homologous recombination. Biol. Chem. 
Vol.383:873-892 and Allen et al, (2003): interactive competition between homologous 
recombination and non-homologous end joining. Mol. Cancer Res M vol 1:913-920). 
All kinds of mutants having at least one component involved in NHR, which is no longer 
capable or at least significantly less capable to perform its function In the process of 
NHR, are mutants contemplated by the present invention. Preferably, the component 
involved in NHR has been inhibited so that the efficiency of the NHR pathway in the 
obtained mutant is less than 90% of the activity in the parent cell it originates from under 
the same conditions as measured in the assay defined earlier, even preferably less than 
85%, more preferably less than 80%, even more preferably less than 70%, most 
preferably less than 50%. 

According to a preferred embodiment, the parent filamentous fungus is the filamentous 
fungus Aspergillus niger CBS 513.88, Aspergillus oryzae ATCC 20423, IFO 4177, ATCC 
1011, ATCC 9576, ATCC 14488-14491, ATCC 11601, ATCC12892. Pen icillium 
chrysogenum CBS 455.95 or Penicillium citrinum ATCC 38065, Penicillium 
chrysogenum P2, Acremonium chrysogenum ATCC 36225 or ATCC 48272, 
Trichoderma reesei ATCC 26921 or ATCC 56765 or ATCC 26921 , Aspergillus sojae 
ATCC1 1 906, Chrysosporium lucknowense ATCC44006 or derivatives thereof. 

Preferably, the filamentous fungus cell is deficient in at least one of the following genes: 
hdfA as identified in SEQ ID NO 2 or homologous thereof, hdfB as identified in SEQ ID 
N05 or homologous thereof, or both, and/or has, preferably inducibly, a decreased 
amount of at least one of the proteins encoded by these genes. 

According to another preferred embodiment, the down regulation of the expression, level- 
of at least one gene or DNA sequence may be obtained by genetic modification by 
submitting the filamentous fungus to mutagenesis. Filamentous fungal cells may be 
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subjected to random mutagenesis and subsequently to a selection assay to Isolate the 
mutants with improved HR from the whole diverse population of mutants. 
According to a preferred embodiment of the present invention, the Aspergillus niger 
strain CBS 513.88, Aspergillus oryzae ATCC 20423, IFO 4177, ATCC 1011, ATCC 
5 9576, ATCC 14488-1 4491, ATCC 11601, ATCC12892, or the Penicilfium chrysogenum 
strain CBS 455.95 or the Trichoderma reesei ATCC 26921 or ATCC 56765 or ATCC 
26921, Aspergillus sojae ATCC11906, Chrysosporium lucknowense ATCC44006 are 
used as starting strain to perform the mutagenesis. 

The starting strain is subjected to UV irradiation so that the survival percentage is ranged 
10 between 0,001% and 60%. Preferably, the survival percentage is ranged between 0,01% 
and 50%. It is well known to the skilled person that conidiospores is the preferred 
material to mutagenize filamentous fungi by physical or chemical means. Mutants may 
however also be obtained from mycelium cells. The selection method described herein 
may be applied to select mutants obtained from either conidiospores or mycelium cells. 
15 Preferably the mutagenesis is applied to conidiospores. UV irradiation Is preferably 
applied for different times such as 7.5, 15 and 30 minutes to obtain mild, medium and 
strong mutation rate levels in the cells. The mutated samples may either be directly re- 
sporulated or incubated for an extended recovery period in a rich medium such as YNB 
or YEPD (see definition in example. 9) before sporulation was induced (for example as 

20 described in example 9). 

The sponilated batches may be then tested for their efficiency in gene targeting. This 
could be tested by the following method. Protoplasts may be transformed with at least 
one, preferably two or more DNA fragments carrying expression cassettes of functional 
selection markers. The selection marker genes in the expression cassettes can be 

25 selected from a number of marker genes that are useful for transformation of filamentous 
fungi. By way of example these markers include but are not limited to dominant and bi- 
directional selection marker gene such as an acetamidase (amdS ) gene (EP 635 574 B 
or WO 97/06261), auxotrophic marker genes such as argB, fcpC.or pyrG and antibiotic 
• resistance genes providing resistance against e.g. phleomycin, hygromycin B or G418. A 

30 preferred selection marker gene is the A.nidulans amdS coding sequence fused to the 
A.nidulans gpdA promoter (EP635 574 B). AmdS genes from other filamentous fungus 
may also be used (WO 97/06261).. Preferably the selection markers are the ble and 
amdS genes. The gene ble encodes a protein capable of conferring resistance to 
phleomycin. The gene amdS encodes a protein enabling cells to grow on acetamide as 
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the sole nitrogen source (as described in EP635 574B). Techniques applied for the 
transfer of DNA to protoplasts of filamentous fungi are well known fn the art and are 
described in many references, including Finkelstein and Ball (eds.). Biotechnology of 
filamentous fungi, technology and products, Butterworth-Heinemann (1992); Bennett and 
5 Lasure (eds.) More Gene Manipulations in fungi, Academic Press (1991); Turner, in: 
PGhler (ed), Biotechnology, second completely revised edition, VHC (1992). The Ca- 
PEG mediated protoplast transformation is used as described in EP635574B. 
To select targeted integration of these two expression cassettes to two distinct specific 
loci in the filamentous fungi genome short homologous stretches of DNA may be added 

10 for example via PCR on both sides of the DNA fragments. Several types of construct 
could be made to improve the chances to select a mutant having an improved targeting 
efficiency: the homologous stretches of DNA could typically vary from 30 bp to 1000bp, 
preferably 30 to 100 bp, more preferably 30 bp. In theory all loci in the filamentous fungi 
genome could be chosen for targeting integration of the expression cassettes. 

15 Preferably, the locus wherein targeting will take place is such that when the wild type 
gene present at this locus has been replaced by the gene comprised in the expression 
cassette, the obtained mutant will display a change detectable by a given assay. 
Preferably the locus is the niaD locus, thereby disrupting the nitrate reductase gene 
(Gouka RJ, van Hartingsveldt W, Bovenberg RA, van den Hondel CA, van Gorcom RF. 

20 Cloning of the nitrate-nitrite reductase gene cluster of Penicillium chrysogenum and use 

of the nlaD gene as a homologous selection marker. J Biotechnol. 1991 Sep;20(2 ):189- 

99), enabling direct selection of transforms nts on plates containing chlorate, as cells 
become resistant to chlorate. Another preferred locus is the sutB locus, thereby 
disrupting the sulphate permease gene (van de Kamp M, Pizzinini E, Vos A, van der 

25 Lende TR, Schuurs TA, Newbert RW, Turner G, Konings WN, Driessen AJ. Sulfate 
transport In Penicillium chrysogenum: cloning and characterization of the sutA and sutB 
genes. J. Bacterid. 1999 Dec;1 81 (23): 7228-34), enabling direct selection of 
transformants on plates containing selenate. Mutants with both selection markers 
present and having the two alterations resulting from the inactivation of the genes 

30 present at the integration loci are strains with improved targeted integration. 

According to another preferred embodiment, the mutant filamentous fungus having a 
lowered efficiency in the NHR pathway, or a decreased NHR/HR ratio and/or an elevated 
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efficiency of the HR pathway is obtained by decreasing, more preferably partially or most 
preferably completely inhibiting a component invoived in NHR. 

Partial or complete inhibition of a component involved in NHR can be obtained by 
different methods, for example by an antibody directed against such a component or a 
chemical inhibitor or a protein inhibitor or a physical inhibitor (Tour O. et al, (2003) Nat. 
Biotech: Genetically targeted chromophore-assisted light inactivation. Vol.21, no. 
12:1505-1508) or peptide inhibitor or an anti-sense molecule or RNAi molecule (R.S. 
Kamath_et al, (2003) Nature: Systematic functional analysis of the Caenorhabdius 
elegans genome using RNAi.vol. 421, 231-237). Irrespective of the kind of (partial or 
more preferably complete) inhibition it is important that a component involved in NHR is 
no longer capable or at least significantly less capable to perform its function in the 
process of NHR as defined above. 

Components involved in NHR comprise filamentous fungal homologous of yeast ku70, 
rad50. mrell, xrs2, Iig4, sir4, ku80, lifl or nejl or associating components. Because the 
nomenclature of genes differs between organisms a functional equivalent or a functional 
and/or a functional fragment thereof, all defined herein as being capable of performing 
(in function, not in amount) at least one function of the yeast genes ku70, rad50. mrell, 
xrs2. Ug4, sir4, ku80, lifl or nejl are also included in the present invention. By transiently 
(partially or more preferably completely) inhibiting a component involved in NHR a 
nucleic acid is integrated at any desired position without permanently modifying a 
component involved in NHR and preventing unwanted side effects caused by the 
permanent presence of such a modified component involved in NHR. 
In addition of the above-mentioned techniques or as an alternative, it Is also possible to 
obtain a lowered NHR efficiency by inhibiting the activity of proteins, which are involved 
in NHR or to re-localize the NHR involved proteins by means of alternative signal 
sequences (Ramon de Lucas. J., Martinez O, Perez P., Isabel Lopez, M., Valenciano, S. 
andLaborda, F. The Aspergillus nldulans carnitine carrier encoded by the acuH gene is 
exclusively located in the mitochondria. FEMS Microbiol Lett. 2001 Jul 24;201(2)M93*.) 
or retention signals (Derkx, P. M. andMadrtd. S. M. The foldase CYPB is a component of 
the secretory pathway of Aspergillus niger and contains the endoplasmic reticulum 
retention signal HEEL. Mol. Genet. Genomics. 2001 Dec;266(4):537-45.). - - 
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Alternatively or in combination with above-mentioned techniques, inhibition of protein 
activity can also be obtained by UV or chemical mutagenesis (Mattern, I.E., van Noort 
J.M., van den Berg, P., Archer, D. B., Roberts, I.N. and van den Hondel, C. A., Isolation 
and characterization of mutants of Aspergillus niger deficient in extracellular proteases. 
5 Mol Gen Genet. 1992 Aug;234(2):332-6.) or by the use of inhibitors like the proteasomal 
inhibitor of Affinity (dasto-lactacystin-p-lactone, Affinity Research Products Ltd., 
CW8405-Z02185.). 

According to another preferred embodiment, the steering towards HR comprises adding 
10 an excess of small double stranded polynucleotides to the polynucleotide to be 
integrated (Agrawal N. et al,: RNA Interference: biology, mechanism and applications. 
Microbiol. Mol. Biol. Rev., vol. 67, no. 4:657-685). 

In a preferred embodiment the invention provides a method for increasing the efficiency 

15 of targeted integration of a polynucleotide to a pre-determined site, whereby said 
polynucleotide has homology at or around the said pre-determined site, in a filamentous 
fungus with a preference for NHR comprising steering an integration pathway towards 
HR by providing a filamentous fungal, wherein the efficiency of the HR pathway has 
been elevated compared to the one of the parent filamentous fungus it originates from 

20 under the same conditions. According to a preferred embodiment, the parent organism is 
the filamentous fungus cell Aspergillus niger CBS 513.88. Aspergillus orvzae ATCC 
20423, IFO 4177, ATCC 1011, ATCC 9576, ATCC 14488-1 4491, ATCC 11601, 
ATCC12892, Penicillium chrysogenum CBS 455.95. or Penicillium citrinum ATCC 
38065, Penicillium chrysogenum P2, Acremonium chrysogenum ATCC 36225 or ATCC 

25 48272, Trichoderma reesei ATCC 26921 or ATCC 56765 or ATCC 26921, Aspergillus 
sojae ATCC1 1906, Chrysosporium lucknowense ATCC44006 or derivatives thereof. 
Elevated means at least higher than in the parental cell the obtained cell originates from. 
Preferably, elevated means twice higher, more preferably three times higher, even more 
preferably four times higher, most preferably more than four times higher using northern, 

30 or western blotting or array technique or a phenotypic screen. According to a preferred 
embodiment, the parent organism is the filamentous fungus cell Aspergillus niger CBS 
513.88, Aspergillus oryzae. ATCC 20423, IFO 4177, ATCC 1011, ATCC 9576, 
ATCC 14488-1 4491, ATCC 11601, ATCC12892, Penicillium chrysogenum CBS 455.95 
or Penicillium citrinum ATCC 38065, Penicillium chrysogenum ' P2, Acremonium 
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chrysogenum ATCC 36225 or ATCC 48272. Trichoderma reesei ATCC 26921 or ATCC 
56765 or ATCC 26921, Aspergillus sojae ATCC11906, Chrysosporium tucknowense 
ATCC44006 or derivatives thereof. 

According to another preferred embodiment, the filamentous fungus has the expression 
level of at least one gene involved in HR, which has been up regulated by comparison to 
the expression level of the same gene in the filamentous fungal cell it originates from. 
This can be achieved by increasing the expression level of a gene encoding a 
component involved in HR and/or by increasing the expression level of a component 
involved in HR and/or by (temporarily) increasing the activity of the component involved 
in HR. 

Preferably, the filamentous fungus obtained has the expression of a gene involved in 
HR, which has been up regulated by comparison to the expression of said gene in the 
filamentous fungal cell it originates from. According to a preferred embodiment, the 
parent organism is the filamentous fungus cell Aspergillus niger CBS 513.88, Aspergillus 
oryzae ATCC 20423, IFO 4177. ATCC 1011. ATCC 9576, ATCC1 4488-14491. ATCC 
11601. ATCC12892. Penicillium chrysogenum CBS 455.95. or Penicillium citrinum 
ATCC 38065, Penicillium chrysogenum P2. Acremonium chrysogenum ATCC 36225 or 
ATCC 48272. Trichoderma reesei ATCC 26921 or ATCC 56765 or ATCC 26921. 
Aspergillus sojae ATCC11906, Chrysosporium lucknowense ATCC44006 or derivatives 
thereof. 

The expression level of a DNA sequence is up regulated when the expression level of 
this specific DNA sequence in the obtained filamentous fungus is higher than the 
expression level of the same DNA sequence in the parental filamentous fungus it 
originates from, preferably three times higher, more preferably four times higher, most 
preferably more than four times higher using northern, or western blotting or array 
technique. According to a preferred embodiment, the parent organism is the deposited 
filamentous fungus cell Aspergillus- niger CBS 513.86, Aspergillus oryzae ATCC 20423. 
IFO 4177, ATCC 1011, ATCC 9576. ATCC 1 4488-1 4491 , ATCC 11601. ATCC12892, 
Penicillium chrysogenum CBS 455.95. Penicillium citrinum ATCC 38065, Penicillium 
chrysogenum P2, Acremonium chrysogenum ATCC 36225 or ATCC 48272, Trichoderma 
reesei ATCC 26921 or ATCC 56765 or ATCC 26921, Aspergillus sojae ATCC11906, 
Chrysosporium lucknowense ATCC44006 or derivatives thereof. 
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The up regulation of the expression level of at least one DNA sequence may be obtained 
by genetic manipulation by one of the following techniques or by a combination thereof: 
c. using recombinant genetic manipulation techniques, 
5 d. submitting the filamentous fungus to mutagenesis, 

Alternatively or in combination with above-mentioned techniques and according to 
another preferred embodiment, the up regulation of the expression level of at least one 
gene or DNA sequence may be obtained by submitting the filamentous fungus to an 
activating compound/composition. 

10 

The filamentous fungus may be subsequently selected by monitoring the expression 
level of said DNA sequence and optionally the efficiency of the HR pathway of the 
filamentous fungus. The HR efficiency of a filamentous fungus may be measured by the 
efficiency of the targeted integration of a given poynucleotide sequence into a pre- 
15 determined site in the genome of the filamentous fungus using given homology 
region(s). 

Preferably, the up regulation of the expression of at least one DNA sequence is made 
with recombinant genetic manipulation techniques such as defined in step a. to obtain a 

20 recombinant filamentous fungus. Preferably step a. comprises transforming the 
filamentous fungus with a DNA construct comprising the DNA sequence, preferably sa id 
DNA sequence being operationally linked to a promoter of a highly expressed gene. The - 
chosen promoter may be stronger than the endogenous promoter of the DNA sequence 
to be over expressed. The promoter for expression of the DNA sequence is preferably 

25 derived from a highly expressed fungal gene. 

A number of preferred highly expressed fungal genes are given by way of example; the 
amylase, glucoamylase, alcohol dehydrogenase, xylanase, glyceraldehyde-phosphate 
dehydrogenase or cellobiohydrolase genes from Aspergilfi or Trichoderma. Most 
• preferred highly expressed genes for these* purposes are an Aspergillus niger 

4 

30 glucoamylase gene, an Aspergillus oryzae TAKA-amylase gene, an Aspergillus nidulans 
gpdA gene or a Trichoderma reesei cellobiohydrolase gene. A glucoamylase promoter is 
the most preferred promoter to be used. These highly expressed genes are suitable both 
as target loci for integration of cloning vectors and as source of highly expressed fungal 
genes. 
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According to another preferred embodiment, step a. comprises increasing the copy 
number of the DNA sequence into the filamentous fungal cell, preferably by integrating 
into its genome copies of the DNA sequence, more preferably by targeting the 
integration of the DNA sequence at a highly expressed locus, preferably at a 
glucoamylase locus. 

The up regulation of the expression of the DNA sequence may be reached by increasing 
the copy number of the DNA sequence by introducing at least one copy of the DNA 
sequence into the filamentous fungus or by changing for a stronger promoter or 
changing for a gene encoding a protein with better kinetics and/or lifetime. The DNA 
sequence may be present on a plasmid or integrated into the genome. The skilled 
person can choose amongst two alternative possibilities: 

- over express at least one endogenous DNA sequence of the filamentous fungus 
being involved In the HR pathway. In this case, the filamentous fugus comprises 
several copies of its endogenous DNA sequence. 

- over express at least one heterologous DNA involved in HR. In this case, the 
filamentous fungus would have its endogenous DNA sequence involved in HR 
and in addition at least one copy of an heterologous DNA sequence involved ,n 
HR.This heterologous DNA sequence is an homologous of its corresponding 
endogenous DNA sequence. 

The filamentous fungus can be transformed with one or more copy of the DNA sequence 
(derived from Inter alia Tilburn et al, 1983, Gene, 2g:205-221). The DNA sequence can 
be either stably integrated into the genome of the filamentous W^M> 
the cel. as part of a DNA molecule capable of autonomous rephcafion. The DNA 
sequence is preferably present on a cloning vector. Any cloning vector capable of 
transforming a filamentous fungal host cell is suitable for use In the present .nventicH, 
Cloning vectors for use in the invention thus comprise integrative cloning vectors, which 
integrate at random or at a predetermined target locus in the chromosomes of the 
filamentous fungal host cell, as we,, as autonomously maintained cloning vectors such 
- as vectors comprising the AMAI-sequence. In a preferred embodiment of the ■nvention, 
the integrative cloning vector comprises a DNA fragment, which is homologous to a DNA 
sequence in a predetermined target locus in the genome of the filamentous fungal host 
cell for targeting the.integration of the cloning vector to this predetermined locus. In order 
to promote targeted integration, the cloning vector is preferably linearized prior to 
transformation of the host cell. Linearization is preferably performed such that at least 
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one but preferably either end of the cloning vector is flanked by sequences homologous 
to the target locus. The length of the homologous sequences flanking the target locus is 
preferably at least 30bp, preferably at least 50 bp, preferably at least 0.1 kb, even 
preferably at least 0.2kb, more preferably at least 0.5 kb, even more preferably at least 1 
5 kb, most preferably at least 2 kb. 

Preferably, the DNA sequence in the cloning vector, which is homologous to the target 
locus is derived from a highly expressed locus meaning that it is derived from a gene, 
which is capable of high expression level in the filamentous fungal host cell. A gene 
capable of high expression level, i.e. a highly expressed gene, is herein defined as a 

10 gene whose mRNA can make up at least 0.5% (w/w) of the total cellular mRNA, e.g. 
under induced conditions, or alternatively, a gene whose gene product can make up at 
least 1% (w/w) of the total cellular protein, or, in case of a secreted gene product, can be 
secreted to a level of at least 0.1 g/l (as described in EP 357 127 B1). 
To increase even more the number of copies of the DNA sequence to be over expressed 

15 the technique of gene conversion as described in W098/46772 may be used. 

The skilled person will appreciate the possibility that the homologous DNA sequence for 
targeting and the promoter sequence can coincide in one DNA fragment The list of 
highly expressed genes given above is also suited as target locus. 

20 An example of an autonomously maintained cloning vector is a cloning vector comprising 

the AMA1 -sequence. AMA1 is a 6.0-kb genomic DNA fragment isola ted from Aspergillus* 

nidulans, which is capable of Autonomous Maintenance In Aspergillus (see e.g. 

Aleksenko and Clutterbuck (1997), Fungal Genet Biol. 21: 373-397). 

» 

25 According to another preferred embodiment of the method of the invention, step a. 
comprises transforming the filamentous fungus with a DNA. construct comprising a 
selection marker gene. The selection marker gene in the cloning vector can be selected 
from a number of marker genes that are useful for transformation of filamentous fungi. 
By way of example these markers include but are not limited to dominant aiid bi- 

30 directional selection marker genes such as an amdS gene (EP 635574, WO 97/06261), 
auxotrophic marker genes such as argB t trpCor pyrG and antibiotic resistance genes 
providing resistance against e.g. phleomycin, hygromycin B or G418.The use of a 
dominant and bi-directional selection marker gene is preferred. Preferably an amdS 
gene is preferred, more preferably an amdS gene from Aspergillus nidulans or 
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Aspergillus niger. A most preferred selection marker gene is the A.nidulans amdS coding 
sequence fused to the A.nidulans gpdA promoter (see EP635574). AmdS genes from 
other filamentous fungus may also be used (WO 97/06261). The amdS selection marker 
gene has the advantage it can be used several times in the same strain to introduce, 
over express and/or delete distinct DNA sequences. By means of counterselection on 
fluoracetamide media as described in EP 635574, the resulting strain is marker free and 
can be used for further gene modifications. 

■ 

Alternatively or in addition with above-mentioned techniques, up regulation of the 
expression of a DNA sequence can be reached using UV or chemical mutagenesis 
(Mattem, I.E., van Noort J.M., van den Berg, P., Archer, D. B., Roberts, I.N. and van den 
Hondel, C. A., Isolation and characterization of mutants of Aspergillus niger deficient in 
extracellular proteases. Mol Gen Genet. 1992 Aug;234(2):332-6.). 

!n addition and/or in combination with up regulation of expression of DNA sequences 
involved in HR, it is also possible to obtain an increased HR efficiency by increasing the 
activity of proteins involved in HR by UV or chemical mutagenesis (Mattern, I.E., van 
Noort J.M., van den Berg, P., Archer, D. B., Roberts, I.N. and van den Hondel, C. A., 
Isolation and characterization of mutants of Aspergillus niger deficient in extracellular 
proteases. Mol Gen Genet. 1992 Aug;234(2):332-6.) . 

The skilled person would understand that to achieve the up regulation of the expression 
of a DNA sequence, one may use each of the described technique either separately or 
in combination. 

The skilled person would also understand that to obtain a filamentous fungus with an 
increased HR/NHR ratio, and/or with a lowered NHR efficiency and/or an elevated HR 
efficiency, one may use at least one of each technique described for respectively down 
and up regulating the expression of a given gene in a filamentous fungus. Preferably, all 
the techniques performed on the filamentous fungus to obtain a recombinant filamentous 
fungus having both a lowered NHR efficiency and an elevated HR efficiency have been 
performed using a dominant and bi-directional selection marker, preferably an amdS 
gene more preferably an amdS gene from Aspergillus nidulans or Aspergillus niger. 
The obtained filamentous fungus may be subsequently. selected by monitoring the 
expression level of said DNA sequence as described before by using for example 
northern and/or western blotting and/or array and/or phenotype screening. Optionally. 
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the efficiency of the NHR and/or HR pathways of the cell is monitored. The efficiency of 
these pathways of a filamentous fungus may be monitored as defined earlier on. 

Preferably, the modification brought in the HR pathway is an inducible one. This can be 
5 reached by replacing the endogenous regulatory regions of the gene encoding the 
component involved in HR by inducible regulatory regions, preferably by using an 
inducible promoter. Examples of inducible promoters are the glucoamylase promoter of 
Aspergillus nlger x the TAKA amylase promoter of Aspergillus oryzae, the PAF promoter 
(Marx,F. t Haas,H., Reindl.M., Stoffler.G., Lottspeich,F. and Redl.B. Cloning, structural 

1 0 organization and regulation of expression of the Penicillium chrysogenum paf gene 
encoding an abundantly secreted protein with antifungal activity Gene 167 (1-2), 167- 
171 (1 995) or the pcbC promoter of Penicillium chrysogenum (Martin JF, Casqueiro J f 
Kosalkova K, Marcos AT r Gutierrez S. Penicillin and cephalosporin biosynthesis: 
mechanism of carbon catabolite regulation of penicillin production. Antonie Van 

15 Leeuwenhoek. 1999 Jan-Feb;75(1-2):21-31. Review. ) orthe switch on/off systems 
earlier cited for down regulation of the expression of genes involved in NHR. 

According to a preferred embodiment, the genes involved in the HR pathway, which are 
modified are the following genes or homologous thereof; Rad51 , rad52. 

20 

Aii-kinds^f-mutants-having--at'-least-one coifipurieni'involvecTinrHR, which is .more 

capable or at least significantly more capable to perform its function in the process of HR 
are mutants contemplated by the present invention. Preferably, the activity of the 
components involved in HR has been modified so that the efficiency of the HR pathway 

25 is more than 110% of the efficiency in the parent cell it originates from under the same 
conditions as measured in the assay defined earlier, more preferably more than 200%, 
most preferably more than 500%. According to a preferred embodiment, the parent 
organism is the filamentous fungus cell Aspergillus niger CBS 51 3.88, Aspergillus oryzae 
ATCC 20423, IFO 4177, ATCC 1011. ATCC 9576, ATCC 1448 8-1 4491, ATCC 11601, 

30 ATCC12892, Penicillium chrysogenum CBS 455.95. or Penicillium citrinum ATCC 
38065, Penicillium chrysogenum P2, Acremonium chrysogenum ATCC 36225 or ATCC 
48272, Thchoderma reesei ATCC 26921 or ATCC 56765 or ATCC 26921, Aspergillus 
sojae ATCC1 1 906, Chrysosporlum lucknowense ATCC44006 or derivatives thereof. 
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Methods according to the present invention, as extensively but not limiting discussed 
above, can be used in a wide variety of applications. Some specific applications are 
described below. 

5 According to a preferred embodiment, there is provided a method which comprises at 
least the steps of introducing a polynucleotide of interest into a given filamentous fungus, 
for example by the process of transformation or electroporation, and integration of said 
polynucelotide in the genetic material of said cell. Integration is a complex process 
wherein a nucleic acid sequence becomes part of the genetic material of a host cell One 
1 0 step in the process of nucleic acid integration is recombination; via recombination nucleic 
acid sequences are exchanged or inserted and the introduced nucleic acid becomes part 
of the genetic material of a host cell. In principle two different ways of recombination are 
possible: homologous and illegitimate or NHR. Most (higher) eukaryotes do not or at 
least not significantly practice HR although the essential proteins to accomplish such a 
15 process are available. One reason for this phenomenon is that frequent use of 
homologous recombination in (higher) eukaryotes could lead to undesirable 
chromosomal rearrangements due to the presence of repetitive nucleic acid sequences. 
To accomplish HR via a method according to the invention, it is important to provide a 
polynucleotide, which has homology with a predetermined site. It Is clear to a person 
20 skilled in the art that the percentage of homology and the length of (a) homologous 
region (s) play (s) an important role in the process of homologous recombination. The 
percentage of homology is preferably close to 100%. A person skilled in the art is aware 
of the fact that tower percentage of homology are also used in the field of homologous 
recombination, but dependent on, for example, the regions of homology and their overall 
25 distribution, can lead to a lower efficiency of HR but are still useful and therefore 
included in the present invention. Furthermore, the length of a (nearly) homologous 
region is approximately 3 kb which is sufficient to direct homologous recombination. At 
least one homologous region is necessary for recombination but more preferably 2 
homologous regions flanking the nucleic acid- of interest are ( used for targeted 
30 integration. The researcher skilled in the art knows how to select the proper percentage 
of homology, the length of homology and the amount of homologous regions. By 
providing such a homology a nucleic acid is integrated at every desired position within 
the genetic material of a host cell. It is clear to a person skilled in the art that the 
invention as disclosed herein is used to direct any nucleic acid (preferably DNA) to any 
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pre-determined site as long as the length of homology and percentage of homology are 
high enough to provide/enable HR. 

Before the present invention was made, a polynucelotide could not have always easily 
5 been integrated at every desired position into the genome of a given filamentous fungus. 
The method according to the invention is applied, for example, to affect the gene 
function in various ways, not only for complete inactivation but also to mediate changes 
In the expression level or in the regulation of expression, changes in protein activity or 
the subcellular targeting of an encoded protein. Complete inactivation, which can usually 

10 not be accomplished by existing methods such as antisense technology orRNAi 
technology (Zrenner R, Willmitzer L, Sonnewald U. Analysis of the expression of potato 
uridinediphosphate-glucose pyrophosphorylase and its Inhibition by antisense RNA. 
Planta. (1993); 190(2): 247-52.) is useful for instance for the inactivation of genes 
controlling undesired side branches of metabolic pathways, for instance to increase the 

15 production of specific secondary metabolites such as (beta-lactam) antibiotics or 
carotenoids. Complete Inactivation is also useful to reduce the production of toxic or 
unwanted compounds (chrysogenin in PeniciMum; Aflatoxin in Aspergillus: MacDonald 
KD et al,: heterokaryon studies and the genetic control of penicillin and chrysogenin 
production in Penicillium chrysogenum. J Gen Microbiol. (1963) 33:375-83). Complete 

20 inactivation is also useful to alter the morphology of the organism in such a way that the 
fermentation pr ocess and down stream processing is improved. 

The invention allows to replace existing regulatory sequences by alternative regulatory 
sequences to alter expression of endogenous genes (e. g. expression in response to 
25 specific inducers. 

One embodiment of the present invention is the replacement of an active gene by an 
inactive gene according to a method of the invention. Complete inactivation, which can 
• usually not be accomplished by existing methods such as antisense technology or RNAI 
30 technology, is useful for instance for the inactivation of genes controlling undesired side 
branches of metabolic pathways, for instance to increase the quality of bulk products 
. „ such as starch, or to increase the production of specific secondary metabolites or to 
Inhibit formation of unwanted metabolites. 
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Another embodiment of the invention relates to the extensive metabolic reprogramming 
or engineering of a filamentous fungal cell. Introduction of complete new pathways 
and/or modification of unwanted pathways will lead to the obtention of a cell specifically 
adapted for the production of a specific compound such as a protein or a metabolite. 

Another embodiment of the present invention is the replacement of an inactive or altered 
gene by an active gene. For example, after successive rounds of classical mutagenesis, 
it often occurs the selected filamentous fungal strain has some endogenous genes 
altered or even inactivated during the random mutagenesis process. 

In yet another embodiment the invention provides a method to introduce a substance 
conferring resistance for an antibiotic substance to a filamentous fungal cell according to 
a method of the invention. Also a method according to the invention is used to confer a 
desired property to a filamentous fungal cell. In a preferred embodiment a gene delivery 
vehicle is used to deliver a desired polynucleotide to a predetermined site. Gene delivery 
vehicles are well known in the art and have been earlier described in the description. 

Also another preferred method according to the- invention is used to effectuate 
predictable expression of transgenes encoding novel products, for example by replacing 
existing coding sequences of genes giving a desired expression profile by those for a 
desired novel product. According to a more preferred embodiment, the filamentous 
fungus provided by the invention further comprises a DNA construct comprising a 
desired gene coding for a desired protein to be produced. 

Preferably, the desired gene encoding the desired protein to be produced is Inserted into 
an expression vector, which is subsequently used to transform the obtained host cell. In 
the expression vector, the DNA sequence may be operationally linked to appropriate 
expression signals, such as a promoter, optionally a signal sequence and a terminator, 
which are capable of directing the expression and synthesis of the protein in the host 
organism. -■ 

More preferably, the desired gene is operationally linked to a promoter and to a 
secretion signal. The strategy, which can be used to express the desired gene is the 
same as the one described under the section up regulation of the expression of a DNA 
sequence, whose expression product is involved in HR: increasing copy number, 
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targeting integration, use of a promoter of a highly expressed gene, choice of the 
selection marker gene and combinations thereof 

The desired protein is preferably an enzyme. If the protein is not naturally secreted, the 
polynucleotide encoding the protein may be modified to have a signal sequence in 
5 accordance with techniques known in the art. The proteins, which are secreted may be 
endogenous proteins which are expressed naturally, but can also be heterologous. 
Heterologous means that the gene encoded by the protein is not produced under native 
condition in the wild type filamentous fungus. Examples of enzymes which may be 
produced by the filamentous fungi of the invention are carbohydrases, e.g. celluiases 
* 10 such as endoglucanases, p-glucanases, cellobiohydrolases or p-glucosidases, 
hemicellulases or pectinolytic enzymes such as xylanases, xylosidases, mannanases, 
galactanases, galactosidases, rhamnogalacturonases, arabanases, galacturonases, 
lyases, or amylolytic enzymes; phosphatases such as phytases, esterases such as 
lipases, proteolytic enzymes, oxidoreductases such as oxidases, transferases, or 
1 5 isomerases. More preferably, the desired gene encodes a phytase. 

As another example existing coding sequences are modified so that the protein encoded 
has optimized characteristics for instance to make a protein with improved thermal 
characteristics and/or improved kinetic properties (Km, Kcat), and/or improved enzyme 
20 stability, and/or extended substrate range, and/or increased life span, etc 



The present invention further relates to the filamentous fungus, which is used in all the 
methods and applications defined previously. 

The filamentous fungus mutant preferably used in the method of the invention is a 
25 mutant originating from a parent cell, wherein the ratio of NHR/HR is decreased and/or 
wherein the efficiency of the NHR pathway has been lowered and/or the efficiency of the 
HR pathway has been elevated in said mutant cell as compared to said ratio and said 
efficiencies in said parent organism under the same conditions. 

According to a preferred embodiment, the parent organism is the deposited filamentous 
30 fungus cell Aspergillus nlger CBS 513.88, Aspergillus oryzae ATCC 20423, IFO 4177, 
ATCC 1011, ATCC 9576, ATCC 14488-1 4491, ATCC 11601, ATCC12892, Penicilfium 
chrysogenum CBS 455.95 or or Pen/cillium citrinum ATCC 38065, Penicillium 
chrysogenum P2, Acremonium chrysogenum ATCC 36225 or ATCC 48272, Trichoderma 
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reesei ATCC 26921 or ATCC 56765 or ATCC 26921, Aspergillus sojae ATCC11906, 
Chrysosporium fucknowense ATCC44006 or derivatives thereof. 

Preferably, the mutant is deficient in a gene encoding a component involved in NHR, 
5 and/or has a decreased level of a component Involved in NHR. 

Even more preferably, the mutant is deficient in at least one of the following genes: hdfA 
or homologous thereof as identified in SEQ ID NO 2, hdfB or homologous thereof as 
identified in SEQ ID NO 5, or both, and/or has, preferably a decreased amount of at least 
one of the proteins encoded by these genes. 
10 Most preferably, the mutant is inducibly deficient in at least one of the following genes: 
hdfA or homologous thereof as identified in SEQ ID NO 2, hdfB or homologous thereof 
as identified in SEQ ID NO 5, or both, and/or has, preferably inducibly, a decreased 
amount of at least one of the proteins encoded by these genes. 

According to another preferred embodiment, the mutant is such that in its genome, a 
15 gene involved in NHR has been replaced by a non-functional gene or by a selection 
marker or by another gene. 

According to another preferred embodiment, the mutant has an increased level of a 
component involved in HR. 

The filamentous fungus according to the Invention may have been obtained by molecular 
20 biology techniques. A filamentous fungus obtained by such a genetic engineering 
approach is defined as a recombinant filamentous fungus. However, a recombinant 
filamentous fungus in the context of the invention could have been subjected earlier in 
time to mutagenesis technique to reach another wanted effect. According to a most 
preferred embodiment, the filamentous fungus obtained is a recombinant filamentous 
25 fungus. 

The host cell of the present invention is a filamentous fungus, which is capable of being 
transformed with a cloning vector. For most filamentous fungi tested thus far it was found 
that they could be transformed- using transformation protocols developed for Aspergillus 
30 (derived from inter alia Tilburn et at. 1983, Gene 26 :205-221). The skilled person will 
recognise that successful transformation of the filamentous fungal host species is not 
limited to the use of vectors, selection marker systems, promoters and transformation 
protocols specifically exemplified herein. 
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A filamentous fungus is herein defined as a eukaryotic micro-organism of the subdivision 
Eumycotina in filamentous form, i.e. the vegetative growth of which occurs by hyphal 
elongation. Preferred filamentous fungal host cells are selected from the group 
consisting of the genera Aspergillus, Trichoderma, Fusarium, Penicillium, and 
5 Acremonium, 

In a more preferred embodiment of the invention, the filamentous fungal host cell is 
selected from the group consisting of A.nidulans, A.oryzae, A.sojae, Aspergill! of the 
A.niger Group, Trichoderma reesei and Penicillium chrysogenum. Preferably the 

10 Penicillium is a Penicillium chrysogenum or Penicillium citrinum species. 

The A.niger group is herein defined according to Raper and .Fennell (1965, In: The 
Genus Aspergillus, The Williams & Wilkins Company, Baltimore, pp 293-344) and 
comprises all (black) Aspergilfi therein included by these authors. Most preferred 
filamentous fungal host cells are selected from the group consisting of Aspergilfi of the 

1 5 A.niger group, A.oryzae, Trichoderma reesei and Penicillium chrysogenum. 

The Invention further relates to the use of such filamentous fungus for producing a 
polypeptide of Interest. Alternatively, the filamentous fungus obtained may be used for 
producing a secondary metabolite. Preferred secondary metabolites are carotenoid 
20 compounds, beta-lactam compounds, drugs, anti-tumor compounds, etc. 

According to a further aspect of the Invention, there are provided the following isolated 
cDNA sequences having SEQ ID NO 2 (hdfA) or 5 (hdfB) or homologous thereof. SEQ 
ID NO 1 and 4 corresponds respectively to the genomic DNA and SEQ ID NO 3 and 6 to 
25 the protein encoded by the respective cDNA sequence- 



The sequence information as provided herein should not be so narrowly construed as to 
require inclusion of erroneously identified bases. The specific sequences disclosed 
herein can be readily used to isolate the complete gene from- filamentous fungi, in 
30 particular A. niger which in turn can easily be subjected to further sequence analyses 
thereby identifying sequencing errors. 

Unless otherwise indicated, all nucleotide sequences determined by sequencing a DNA 
molecule herein were determined using an automated DNA sequencer and all amino 
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acid sequences of polypeptides encoded by DNA molecules determined herein were 
predicted by translation of a DNA sequence determined as above. Therefore, as is 
known in the art for any DNA sequence determined by this automated approach, any 
nucleotide sequence determined herein may contain some errors. Nucleotide sequences 
determined by automation are typically at least about 90% identical, more typically at 
least about 95% to at least about 99.9% Identical to the actual nucleotide sequence of 
the sequenced DNA molecule. The actual sequence can be more precisely determined 
by other approaches including manual DNA sequencing methods well known in the art. 
As is also known in the art, a single insertion or deletion in a determined nucleotide 
sequence compared to the actual sequence will cause a frame shift in translation of the 
nucleotide sequence such that the predicted amino acid sequence encoded by a 
determined nucleotide sequence will be completely different from the amino acid 
sequence actually encoded by the sequenced DNA molecule, beginning at the point of 
such an insertion or deletion. 

The person skilled In the art is capable of identifying such erroneously identified bases 
and knows how to correct for such errors. 

"Homologous" is below defined. Homologous can be understood as meaning derived 
from other filamentous fungus than Aspergillus niger* 

Full length DNA from other organisms can be obtained in a typical approach, using 
cDNA or genomic DNA libraries constructed from other organisms, e.g. filamentous 
fungi, in particular from the species Aspergillus by screening them. 
The invention also encompasses paralogous of hdfA and/or hdfB. In the context of the 
invention, paralogous means DNA sequences homologous of SEQ ID NO: 1 or SEQ ID 
NO:4 and derived from A. niger. 

For example. Aspergillus strains can be screened for homologous hdfA and/or hdfB 
polynucleotides by Northern blot analysis. Upon detection of transcripts homologous to 
polynucleotides according to the invention, cDNA libraries can be constructed "from RNA ' 
isolated from the appropriate strain, utilizing standard techniques well known to those of 
skill in the art. Alternatively, a total genomic DNA library can be screened using a probe 
hybridisable to an hdfA and/or hdfB polynucleotide according to the invention. 
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Homologous gene sequences can be isolated, for example, by performing PCR using 

« 

two degenerate oligonucleotide primer pools designed on the basis of nucleotide 
sequences as taught herein. 

5 The template for the reaction can be cDNA obtained by reverse transcription of mRNA 
prepared from strains known or suspected to express a polynucleotide according to the 
invention. The PCR product can be subcloned and sequenced to ensure that the 
amplified sequences represent the sequences of a new hdfA and/or hdfB nucleic acid 
sequence, or a functional equivalent thereof. 

10 

The PCR fragment can then be used to isolate a full length cDNA clone by a variety of 
known methods. For example, the amplified fragment can be labeled and used to screen 
a bacteriophage or cosmid cDNA library. Alternatively, the labeled fragment can be used 
to screen a genomic library. 

15 

PCR technology also can be used to isolate full length cDNA sequences from other 
organisms^ For example, RNA can be isolated, following standard procedures, from an 
appropriate cellular or tissue source. A reverse transcription reaction can be performed 
on the RNA using an oligonucleotide primer specific for the most 5' end of the amplified 
20 fragment for the priming of first strand synthesis. 



The resulting RNA/DNA hybrid can then be "tailed" (e.g., with guanines) using a 
standard terminal transferase reaction, the hybrid can be digested with RNase H, and 
second strand synthesis can then be primed (e.g., with a poly-C primer). Thus, cDNA 
25 sequences upstream of the amplified fragment can easily be isolated. For a review of 
useful cloning strategies, see e.g. f Sambrook et al., shdfAa; and Ausubel et ah, shdfAa. 

"Homologous 1 * can also be understood as meaning functional equivalents. 

30 The terms "functional equivalents" and "functional variants" are used interchangeably 
herein. Functional equivalents of hdfA and/or hdfB DNA are isolated DNA fragments that 
encode a polypeptide that exhibits, a particular function of the hdfA and/or hdfB. A 
functional equivalent of an hdfA and/or hdfB polypeptide according to the invention is a 



29 



polypeptide that exhibits at least one function as part of the NHR complex. . Functional 
equivalents therefore also encompass biologically active fragments. 

Functional protein or polypeptide equivalents may contain only conservative 
substitutions of one or more amino acids of sequences having SEQ ID NO 3 or 6 or 
substitutions, insertions or deletions of non-essential amino acids. Accordingly, a non- 
essential amino acid is a residue that can be altered in one of these sequences without 
substantially altering the biological function. For example, amino acid residues that are 
conserved among the hdfA and/or hdfB proteins of the present invention, are predicted 
to be particularly unamenable to alteration. Furthermore, amino acids conserved among 
the hdfA and/or hdfB proteins according to the present invention are not likely to be 
amenable to alteration. 

The term "conservative substitution" is intended to mean that a substitution in which the 
amino acid residue is replaced with an amino acid residue having a similar side chain. 
These families are known in the art and include amino acids with basic side chains 
(e.g.lysine. arginlne and hystidine), acidic side chains (e.g. aspartic acid, glutamic acid), 
uncharged polar side chains (e.g., glycine, asparagines, glutamine, serine, threonine, 
tyrosine, cysteine), non-polar side chains (e.g., alanine, valine, leucine, isoleucine, 
proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., 
threonine, valine, isoleucine) and aromatic side chains (e.g.. tyrosine, phenylalanine 
tryptophan, histidine). 

Functional nucleic acid equivalents may typically contain silent mutations or mutations 
that do not alter the biological function of encoded polypeptide. Accordingly, the 
invention provides nucleic acid molecules encoding hdfA and/or hdfB proteins that 
contain changes in amino acid residues that are not essential for a particular biological 
activity. Such hdfA and/or hdfB proteins differ in amino acid sequence from SEQ ID N03 
or 6, yet retain at least one of their biological activities. In one embodiment the Isolated 
nucleic acid molecule comprises a nucleotide sequence encoding a protein, wherein the 
protein comprises a substantially . homologous amino acid sequence of at least about 
60%, 65%. 70%. 75%. 80%. 85%. 90%. 95%. 96%. 97%. 98%. 99% or more 
homologous to the amino acid sequence shown in SEQ ID NO 3 or 6. For example, 
guidance concerning how to make phenotypically silent amino acid substitutions is 
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provided in Bowie. J.U. et a!., Science 247:1306-1310 (1990) wherein the authors 
indicate that there are two main approaches for studying the tolerance of an amino acid 
sequence to change. The first method relies on the process of evolution, in which 
mutations are either accepted or rejected by natural selection. The second approach 
uses genetic engineering to introduce amino acid changes at specific positions of a 
cloned gene and selects or screens to identify sequences that maintain functionality. As 
the authors state, these studies have revealed that proteins are surprisingly tolerant of 
amino acid substitutions. The authors further indicate which changes are likely to be 
permissive at a certain position of the protein. For example, most buried amino acid 
residues require non-polar side chains, whereas few features of surface side chains are 
generally conserved. Other such phenotypically silent substitutions are described in 
Bowie et al, shdfAa, and the references cited therein. 

An isolated nucleic acid molecule encoding an hdfA and/or hdfB protein homologous to 
the protein according to SEQ ID NO 3 or 6 can be created by introducing one or more 
nucleotide substitutions, additions or deletions into the coding nucleotide sequences 
according to SEQ ID NO 2 or SEQ ID NO 5, such that one or more amino acid 
substitutions, deletions or insertions are introduced into the encoded protein. Such 
mutations may be introduced by standard techniques, such as site-directed mutagenesis 
and PCR-mediated mutagenesis. 



The term "functional equivalents" also encompasses orthologues of the A. niger hdfA 
and/or hdfB protein. Orthologues of the A. niger hdfA and/or hdfB protein are proteins 
that can be isolated from other strains or species and possess a similar or identical 
biological activity. Such orthologues can readily be identified as comprising an amino 
acid sequence that is substantially homologous to SEQ ID NO 3 or 6. 

"Homologous" can also be understood as meaning "substantially homologous". 
The term "substantially homologous" refers to a first amino acid or nucleotide sequence 
which contains a sufficient or minimum number of identical or equivalent (e.g., with 
similar side chain) amino acids or nucleotides to a second amino acid or nucleotide 
sequence such that the first and the second amino acid or nucleotide sequences have a 
common domain. For example, amino acid or nucleotide sequences which contain a 
common domain having about 45%, preferably about 50%, preferably about 60%, 
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preferably about 65%, more preferably about 70%, even more preferably about 75%, 
80%, 85%, 90%, 95%, 96%, 97%, 98% or 99% identity or more are defined herein as 
sufficiently identical. 

Also, nucleic acids encoding other hdfA and/or hdfB family members, which thus have a 
5 nucleotide sequence that differs from SEQ ID NO 2 and 5, are within the scope of the 
invention. Moreover, nucleic acids encoding hdfA and/or hdfB proteins from different 
species, which thus have a nucleotide sequence which differs from SEQ ID NO 2 AND 5. 
Nucleic acid molecules corresponding to variants (e.g. natural allelic variants) and 
homologues of the hdfA and/or hdfB DNA of the invention can be isolated based on their 
10 homology to the hdfA and/or hdfB nucleic acids disclosed herein using the cDNAs 
disclosed herein or a suitable fragment thereof, as a hybridisation probe according to 
standard hybridisation techniques preferably under highly stringent hybridisation 
conditions. 

"Stringency" of hybridization reactions is readily determinable by one of ordinary skill in 
15 the art, and generally is an empirical calculation dependent upon probe length, washing 
temperature, and salt concentration. In general, longer probes require higher 
temperatures for proper annealing, while shorter probes need lower temperatures. 
Hybridization generally depends on the ability of denatured DNA to reanneal when 
complementary strands are present in an environment below their melting temperature. 
20 The higher the degree of desired homology between the probe and hybridizable 
sequence, the higher the relative temperature, which can be used. As a result, it follows 
that higher relative temperatures would tend to make the reaction conditions more 
stringent, while lower temperatures less so. 

For additional details and explanation of stringency of hybridization reactions, see 
25 Ausubel et al, Current Protocols in Molecular Biology, Wiley Interscience Publishers, 
(1995). 

"Stringent conditions" or "high stringency conditions", as defined herein, may be 
identified by those that: (1) employ low ionic strength and high temperature for washing, 
for example 0, 015 M sodium chloride/0. 0015 M sodium citrate/0. 1% sodium dodecyl 
30 sulfate at 50°C; (2) employ during hybridization a denaturing agent, such as formamide, 
for example, 50% (v/v) formamide with 0.1 % bovine serum albumin/0. 1 % Ficoll/0. 1 % 
polyvinylpyrrolidone/50mM sodium phosphate buffer at pH 6.5 with 750 mM sodium 
chloride, 75 mM sodium citrate at 42°C ; or (3) employ 50% formamide, 5 x SSC (0.75 M 
NaCl, 0.075 M sodium citrate), 50 mM sodium phosphate (pH 6.8), 0.1 % sodium 
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pyrophosphate, 5 x Denhardt's solution, sonicated salmon sperm DNA (50 Rg/ml), 0.1% 
SDS, and 10% dextran sulfate at 42°C T with washes at 42°C in 0.2 x SSC (sodium 
chloride/sodium citrate) and 50% fomnamlde at 55°C, followed by a high-stringency wash 
consisting of 0.1 x SSC containing EDTA at 55°C. 

5 

"Moderately stringent conditions" may be identified as described by Sambrook et al., 
Molecular Cloning: A Laboratory Manual, New York: Cold Spring Harbor Press, 1989, 
and include the use of washing solution and hybridization conditions (e. g., temperature, 
ionic strength and % SDS) less stringent that those described above. An example of 

1 0 moderately stringent conditions is overnight incubation at 37°C in a solution comprising: 
20% formamide, 5 x SSC (150 mM NaCI, 15 mM trisodium citrate), 50 mM sodium 
phosphate (pH 7.6), 5 x Denhardt's solution, 10% dextran sulfate, and 20 mg/ml 
denatured sheared salmon sperm DNA, followed by washing the filters in 1 x SSC at 
about 37-50 C. The skilled artisan will recognize how to adjust the temperature, ionic 

1 5 strength, etc. as necessary to accommodate factors such as probe length and the like, 
or by using an algorithm suitable for determining sequence similarity. 

Homologous (similar or identical) sequences can also be determined by using a 
"sequence comparison algorithm". Optima! alignment of sequences for comparison can 

20 be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 
Math. 2: 482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. 
Mol. Biol. 48: 443 (1970), by the search for similarity method of Pearson & Lipman, Proc. 
NaH Acad. ScL USA 85: 2444 (1988), by computerized implementations of these 
algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software 

25 Package, Genetics Computer Group, 575 Science Dr., Madison, Wl), or by visual 
inspection. An example of an algorithm that is suitable for determining sequence 
similarity is the BLAST algorithm, which is described in Altschul, et al. t J. Mol. Biol. 215: 
403-410(1990). 

30 Software for performing BLAST analyses is publicly available through the National 

Center for Biotechnology Information (http://www. ncbi. nlm. nih. gov/). This algorithm 
involves first identifying high scoring sequence pairs (HSPs) by identifying short words of 
length W in the query sequence that either match or satisfy some positive-valued 
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threshold score T when aligned with a word of the same length in a database sequence. 
These initial neighborhood word hits act as starting points to find longer HSPs containing 
them. The word hits are expanded in both directions along each of the two sequences 
being compared for as far as the cumulative alignment score can be increased. 
Extension of the word hits is stopped when: the cumulative alignment score falls off by 
the quantity X from a maximum achieved value; the cumulative score goes to zero or 
below; or the end of either sequence is reached. 

The BLAST algorithm parameters W, T. and X determine the sensitivity and speed of the 
alignment. The BLAST program uses as defaults a wordlength (W) of 1 1 , the 
BLOSUM62 scoring matrix (see Henikoff & Henikoff. Proc. Natl. Acad. Sci. USA 89: 
10915 (1989)) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison 
of both strands. 

The BLAST algorithm then performs a statistical analysis of the similarity between two 
sequences (see, e. g., Karlin & Altschul, Proc. Natl. Acad. Sci. USA 90: 5873-5787 
(1 993)). One measure of similarity provided by the BLAST algorithm is the smallest sum 
probability (P (N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, 
an amino acid sequence is considered similar to a protein such as a protease if the 
smallest sum probability in a comparison of the test amino acid sequence to a protein 
such as a protease amino acid sequence is less than about 0.1, more preferably less 
than about 0.01, and most preferably less than about 0.001. Preferably the similarity is 
at least 40% homology to one of the DNA sequences having SEQ ID N02 and 5. More 
preferably the similarity is at least 50%, more preferably, at least 60%, more preferably 
at least 70%, more preferably at least 80%, more preferably at least 90% . 

In addition to naturally occurring allelic variants of the hdf A and/or hdfB sequence, the 
skilled person will recognise that changes can be introduced by mutation into the 
nucleotide sequences of SEQ ID NO 2 and 5 thereby leading to changes in the amino 
acid sequence of the hdf A and/or hdfB protein without substantially altering the function 
of the hdfA and/or hdfB protein. 
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In another aspect of the Invention, deteriorated hdfA and/or hdfB proteins are provided. 
Deteriorated hdfA and/or hdfB proteins are proteins, wherein at least one biological 
activity Is decreased. Such proteins may be obtained by randomly introducing mutations 
along all or part of the hdfA and/or hdfB coding sequence, such as by saturation 
5 mutagenesis, and the resulting mutants can be expressed recombinantly and screened 
for biological activity. For instance, the art provides for standard assays for measuring 
their enzymatic activity and thus deteriorated proteins may easily be selected. 
Preferably, the assay is the one described earlier on (see for example WO02/052026 
page 23 or the phenotypic screening assay). 

10 

In a preferred embodiment, the hdfA and/or hdfB protein has an amino acid sequence 
according to SEQ ID N03 or 6. In another embodiment, the hdfA and/or hdfB 
polypeptide is substantially homologous to the amino acid sequence according to SEQ 3 
and 6 and retains at least one biological activity of a polypeptide according to SEQ ID3 
1 5 and 6, yet differs in amino acid sequence due to natural variation or mutagenesis as 
described above. 

In a further preferred embodiment, the hdfA and/or hdfB protein has an amino acid 
sequence encoded by an isolated nucleic acid fragment capable of hybridising to a 
20 nucleic acid according to SEQ ID NO 2 and 5, preferably under highly stringent 

hybridis ation conditions. ■. — 

Accordingly, the hdfA and/or hdfB protein is a protein which comprises an amino acid 
sequence at least about 60%, 65%, 70%, 75%, 80%, 85%, 90%, 95%, 96%, 97%, 98%, 
25 99% or more homologous to the amino acid sequence shown in SEQ ID NO 3 and 6 and 
retains at least one functional activity of the polypeptide according to SEQ ID NO 3 and 
6. 

Functional equivalents of a protein according to the invention can also be identified e.g. 
30 by screening combinatorial libraries of mutants, e.g. truncation mutants, of the protein of 
the invention for a given activity. In one embodiment, a variegated library of variants is 
generated by combinatorial mutagenesis at the nucleic acid level. A variegated library of 
variants can be produced by, for example, enzymatically ligating a mixture of synthetic 
oligonucleotides Into gene sequences such that a degenerate set of potential protein 
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sequences is expressible as individual polypeptides, or alternatively, as a set of larger 
fusion proteins (e.g., for phage display). There are a variety of methods that can be used 
to produce libraries of potential variants of the polypeptides of the invention from a 
degenerate oligonucleotide sequence. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e.g., Narang (1983) Tetrahedron 39:3; Itakura 
et at. (1984) Anna Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et 
al. (1983) Nucleic Acid Res. 11:477). 

In addition, libraries of fragments of the coding sequence of a polypeptide of the 
invention can be used to generate a variegated population of polypeptides for screening 
a subsequent selection of variants. For example, a library of coding sequence fragments 
can be generated by treating a double stranded PGR fragment of the coding sequence 
of Interest with a nuclease under conditions wherein nicking occurs only about once per 
molecule, denaturing the double stranded DNA, renaturing the DNA to form double 
stranded DNA which can include sense/antisense pairs from different nicked products, 
removing single stranded portions from reformed duplexes by treatment with S1 
nuclease, and ligating the resulting fragment library into an expression vector. By this 
method, an expression library can be derived which encodes IM-terminal and Internal 
fragments of various sizes of the protein of interest. 

Several techniques are known in the art for screening gene products of combinatorial 
libraries made by point mutations of truncation, and for screening cDNA libraries for 
gene products having a selected property. The most widely used techniques, which are 
amenable to high through-put analysis, for screening large gene libraries typically 
include cloning the gene library into repllcable expression vectors, transforming 
appropriate cells with the resulting library of vectors, and expressing the combinatorial 
genes under conditions in which detection of a desired activity facilitates isolation of the 
vector encoding the gene whose product was detected. Recursive ensemble 
mutagenesis (REM), a technique which enhances the frequency of functional mutants in 
the libraries, can be used in combination with the screening assays to Identify variants of 
a protein of the invention (Arkin and Yourvan (1992) Proc. Natl. Acad. Sci. USA 89:7811- 
781 5; Delgrave et al. (1 993) Protein Engineering 6(3):327-331 ). 
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In addition to the hdfA and/or hdfB gene sequences shown in SEQ ID NO 2 and 5, it will 
be apparent for the person skilled in the art that DNA sequence polymorphisms that may 
lead to changes in the amino acid sequence of the hdfA and/or hdfB protein may exist 
within a given population. Such genetic polymorphisms may exist in cells from different 
5 populations or within a population due to natural allelic variation. Allelic variants may also 
include functional equivalents. 

Fragments of a polynucleotide according to the invention may also comprise 
polynucleotides not encoding functional polypeptides. Such polynucleotides may 
10 function as probes or primers for a PCR reaction. 

Nucleic acids according to the invention irrespective of whether they encode functional 
or non-functional polypeptides, can be used as hybridization probes or polymerase chain 
reaction (PCR) primers. Uses of the nucleic acid molecules of the present invention that 

15 do not encode a polypeptide having an hdfA and/or hdfB activity include, inter alia, (1) 
isolating the gene encoding the hdfA and/or hdfB protein, or allelic variants thereof from 
a cDNA library e.g. from other organisms than A niger, (2) in situ hybridization (e.g. 
FISH) to metaphase chromosomal spreads to provide precise chromosomal location of 
the hdfA and/or hdfB gene as described In Verma et al., Human Chromosomes: a 

20 Manual of Basic Techniques, Pergamon Press, New York (1 988); (3) Northern blot 

an^^sjQj^etejDtiag_e^ 

cells and 4) probes and primers that can be used as a diagnostic tool to analyse the 
presence of a nucleic acid hybridisable to the hdfA and/or hdfB probe in a given 
biological (e.g. tissue) sample. 

25 

Also encompassed by the invention is a method of obtaining a functional equivalent of 
an hdfA and/or hdfB gene or cDNA. Such a method entails obtaining a labelled probe 
that includes an isolated nucleic acid which encodes all or a portion of the sequence 
* according to SEQ ID NO NO 2 or 5 or a variant thereof; screening a nucleic acid 
30 fragment library with the labelled probe under conditions that allow hybridisation of the 
probe to nucleic acid fragments in the library, thereby forming nucleic acid duplexes, and 
preparing a full-length gene sequence from the nucleic a.cld, fragments in any labelled 
duplex to obtain a gene related to the hdfA and/or hdfB gene. 
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In one embodiment, an hdfA and/or hdfB nucleic acid of the invention is at least 45%, 
50%, 55%,60%, 65%. 70%, 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 
97%, 98%, 99%, or more homologous to a nucleic acid sequence shown in SEQ ID 1 , or 
2, or 4 or 5. 

In another preferred embodiment an hdfA and/or hdfB polypeptide of the invention is at 
least 60%, 65%, 70%. 75%, 80%, 85%, 90%, 91%. 92%, 93%, 94%, 95%, 96%, 97%, 
98%, 99%, or more homologous to the amino acid sequence shown in SEQ ID N 3 or 6 

The invention relates to DNA sequences having SEQ ID NO 1 , or 2, or 4 , or 5 per se 
and to homologous thereof as defined above. DNA sequences related to these DNA 
sequences and obtained by degeneration of the genetic code are also part of the 
invention. DNA sequences related to DNA SEQ ID NO 2 and 5 and obtained by 
hybridisation (see former paragrapgh) are also part of the invention. Isolated polypeptide 
encoded by these DNA sequences or homologous thereof as defined above are also 
part of the invention. Polypeptides involved in the hdfA and hdfB have a function 
involved in NHR. All these polypeptides can be used in the method of the invention to 
obtain filamentous fungi, which may have improved targeting efficiencies. 

The invention further relates to the use of any of the filamentous fungus as obtained in 
the present invention for producing the desired protein by culturing the transformed host 
cell under conditions conducive to the expression of the DNA sequence encoding the 
desired protein, and recovering the desired protein as described for example in the 

following references: 

- Li, Z. J., Shukla, V. f Fordyce, A. P., Pedersen, A. G., Wenger, K. S. f Marten, M. 
R. Fungal morphology and fragmentation behavior in a fed-batch Aspergillus 
oryzae fermentation at the production scale. 

Biotechnol Bioeng. 2000 Nov 5;70(3):300-12 

♦ * 

- Withers, J. M., Swift, R. J., Wiebe, M. G M Robson, G. D., Punt, P. J., van den 
Hondel, C. A. Optimization and stability of glucoamylase production by 
recombinant strains of Aspergillus niger in chemostat culture. 

Biotechnol Bioeng. 1998 Aug 20;59(4):407-1 8. s 
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- Amanullah, A., Christensen, L H. f Hansen, K. f Nlenow, A. W., Thomas, R. C. 
Dependence of morphology on agitation intensity in fed-batch cultures of 
Aspergillus oryzae and its implications for recombinant protein production. 
Biotechnol Bioeng. 2002 Mar 30;77(7):815-26. 

5 

The invention will be illustrated in more detail in the following examples. Such 
examplesare not intended to limit the scope of the invention. 

Brief description of the drawings 

10 Figure 1 depicts the replacement vector pDEL-HDFA used to inactive the hdfA gene in 
Aspergillus niger. 

Figure 2 depicts the replacement vector pDEL-HDFB used to inactive the hdfB gene in 
Aspergillus niger. 

Figure 3 depicts the strategy used to delete the hdfA gene of Aspergillus niger. 
15 Figure 4 depicts the strategy used to delete the hdfB gene of Aspergillus niger 

Figure 5 depicts the schematic strategy used to integrate a DNA construct into the 
genome of Aspergillus niger through single homologous recombination. 

EXAMPLES 

20 

Ex3mp!e^4-?dentSffcatiQR-Gf-ih^ 

deletion vectors. 

Genomic DNA of Aspergillus niger strain CBS513.88 was sequenced and analyzed. Two 
genes with translated proteins annotated as homologues to Ku70 and Ku80, were 

25 identified and named hdfA and hdfB respectively. Sequences of the hdfA and hdfB loci, 
comprising the open reading frame (ORF) (with introns) and approximately 1000 bp 5' 
and 3' of the genes, are shown in sequence listings 1 and 4. Gene replacement vectors 
for hdfA and hdfB were designed according to known principles and constructed 
according to routine cloning procedures (see figures 1 and 2). In essence, these vectors 

30 comprise approximately 1000 bp flanking regions of the hdf ORFs for homologous 

recombination at the predestined genomic loci. In addition, they contain the A. nidufans 
bi-directional amdS selection: marker, in-between direct repeats. The general design of 
these deletion vectors were previously described in EP635574B and WO 98/46772. 
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Example 2: Inactivation of the hdfA gene in Aspergillus niger 
Linear DNA of deletion vector pDEL-HDFA (figure 1) was isolated and used to transform 
Aspergillus niger CBS51 3.88 using method earlier described (Biotechnology of 
Filamentous fungi: Technology and Products. (1992) Reed Publishing (USA); Chapter 6: 
Transformation pages 1 13 to 1 56).This linear DNA can integrate into the genome at the 
hdfA locus, thus substituting the hdfA gene by the amdS gene as depicted in figure 3. 
Transformants were selected on acetamide media and colony purified according to 
standard procedures as described in EP635574B. Spores were plated on fluoro- 
acetamide media to select strains, which lost the amdS marker. Growing colonies were 
diagnosed by PGR for integration at the hdfA locus and candidate strains tested by 
Southern analyses for deletion of the hdfA gene. Deletion of the hdfA gene was 
detectable by ~ 2,2 kb size reduction of DNA fragments covering the entire locus and 
hybridized to appropriate probes. Approximately 8 strains showed a removal of the 
genomic hdfA gene from a pool of approximately 400 initial transformants. 
Strain dHDFA was selected as a representative strain with the hdfA gene inactivated. 

■ 

Example 3: Inactivation of the hdfB gene I n Aspergillus niger. 
Linear DNA of deletion vector pDEL-HDFB (figure 2) was isolated and used to transform 
the Aspergillus niger strain CBS51 3.88. This linear DNA can integrate into the genome 
at the hdfB locus, thus substituting the hdlB gene for amdS (figure 4). The same 
technique of gene replacement was used as the one described in example 
2Transformants were selected on acetamide media and colony purified according to 
standard procedures. Spores were plated on fluoro-acetamide media to select strains, 
which lost the amdS marker (EP 635574B). Growing colonies were diagnosed by PCR 
for integration at the hdfB locus and candidate strains tested by Southern analyses for 
deletion of the hdfB gene. Deletion of the hdfB gene was detectable by ~ 2,6 kb size 
reduction of DNA fragments covering the entire locus and hybridized to appropriate 
probes. Approximately 7 strains showed a removal of the genomic hdfB gene from a 
pool of approximately 370 initial transformants. 

Strain dHDFB was selected as a representative strain with the hdfB gene inactivated. 

Example 4: Inactivation of the hdfA and hdfB genes in Aspergillus niger 
Linear DNA of deletion vector pDEL-HDFB (figure 2) was isolated and used to transform 
strain dHDFA obtained in Example 2. This linear DNA can integrate into the genome at 
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the hdfB locus, thus substituting the hdfB gene for amdS (figure 4). The technique of 
gene replacement used is the one described in example 2Transformants were selected 
on acetamide media and colony purified according to standard procedures. Spores were 
plated on fiuoro-acetamide media to select strains, which lost the amdS marker. Growing 
5 colonies were diagnosed by PCR for integration at the hdfB locus and candidate strains 
tested by Southern analyses for deletion of the hdfB gene. Deletion of the hdfB gene 
was detectable by ~ 2,6 kb size reduction of DNA fragments covering the entire locus 
and hybridized to appropriate probes. Approximately 15 strains showed a removal of the 
genomic hdfB gene from a pool of approximately 380 initial transformants. 
10 Strain dHDFAB was selected as a representative strain with both the hdfA and hdfB 
genes inactivated. 

Example 5: Improved targeting for single homologous recombination events. 

One mechanism by which DNA may integrate into the genome of Aspergillus niger at a 
15 predestined locus is through a single homologous recombination. Homologous DNA 
aligns and integrates at the genomic position by recombination (see figure 5). Two 
vectors were used to test the targeting efficiency through a single homologous 
recombination of Aspergillus niger strains obtained in examples 2, 3, and 4. The two 
vectors comprise regions homologous to the glucoamylase (glaA) locus to direct 
20 recombination and resulting integration (figure 5). 

IheJicst j/jRctocctesigned for SLicfa-homologous4ntegration-ha&-aire3dy-been-earli6r 

described in WO 02/45524 (pGBFINI 1-EPO). This vector contains the gene coding for 
the proline specific endoprotease. 

The second vector (pGBFINI 1-PLA) contains the gene coding for phospholipase A1 
25 (PLA1 ) from A. oryzae. The gene encoding this enzyme has already been published 
(Watanabe I, etal, Biosci. BiotechnoJ. Biochem. .(1999), Vol 63, numero 5, pages 820- 
826). This gene was cloned into pGBFIN11 using the same technique as described in 
WO 02/045524 for the cloning of the proline specific endoprotease gene in pGBFIN1 1- 
EPO. 

30 Strains CBS51 3.88, dHDFA, dHDFB and dHDFAB were transformed with either 

pGBFINI 1-EPO or pGBFINI 1-PLA plasmids according to transformation techniques 
earlier described in example 2. The results obtained were the same with both plasmids 
used. We found respectively, 5 %, 10%, 10% and 10%, of transformants with plasmids 
integrated at the target locus. Hence, we concluded that the inactivation of at least one 
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hdf-gene in Aspergillus niger leads to a significant increase of the targeting efficiency of 
these strains through a single homologous recombination event. 

Example 6: Improved targeting for double homologous recombination events. 
Another mechanism by which DNA may integrate into the genome of Aspergillus niger at 
a predestined locus is through double homologous recombination. This method was 
employed for instance to inactivate the hdfA and hdfB genes as previously described in 
respectively examples 2 and 3. When, strain CBS513.88 was transformed with deletion 
vectors pDEL-HDFA or pDEL-HDFB (see examples 2 and 3), approximately 2 % of the 
resulting transformants displayed an integration of the DNA construct at the homologous 
locus. However, when the dHDFA strain was transformed with the pDEL-HDFB deletion 
construct, its integration at the hdfB locus increased to approximately 4 % (see example 
4). Similar results were obtained for the reversed experiment when targeted integration 
at the hdfA locus (dHDFB strain transformed with pDEL-HDFA) increased to 
approximately 4 %. Hence, we concluded that the inactivation of at least one hdf-gene in 
Aspergillus niger leads to a significant increase of the targeting efficiency of these strains 
through double homologous recombination events. 

Example 7: Size reduction off the homologo us region. 

A technical drawback for the construction of integration vectors is the relatively long 
stretches of DNA required for homologous recombination. For example, approximately 
1kb of the homologous 5' and 3' regions of the hdf genes was used to inactivate the hdf 
genes (see example 1). As previously mentioned about 2 % of these constructs 
displayed homologous recombination in the CBS513.88 strain (see examples 2 and 3). 
Both pDEL-HDFA and pDEL-HDFB plasmids were modified such that the 5' and 3' 
flanking regions required for homologous recombination were trimmed to approximately 
250 bp by using nuclease enzymes (Molecular Cloning: A Laboratory Manual (3 Volume Set) 
T. Maniatis, Joseph Sambrook, E. F. Fritsch, December, 1989 Cold Spring Harbor Laboratory 
ISBN: 0879693096). The resulting plasmids pDEL-HDFA-2 and pDELHDFB-2 were 
separately transformed to strains dHDFA and dHDFB, respectively using transformation 
method described in example 2. Approximately, 2 % of the transformants showed 
targeted integration of the vectors at the respective loci. In contrast, targeted integration 
decreased to only 0.5 % when strain CBS513.88 (not carrying a deletion of any of the 
hdf genes) was transformed with these vectors. Hence, we concluded that the 
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inactivation of at feast one hdf-gene in Aspergillus niger allowed a significant reduction of 
the plasmids homologous regions. 

Similar results were obtained for truncation of the homologous regions of the g/aA locus 
in plasmids pGBFIN11-EPO and pGBFIN-11PLA (as described and used in example 5), 
5 designed for single homologous recombination. 

N. 

Example 8: Phenotype analysis and production of polypeptide. 

No phenotyplc differences were observed during growth of the dHDFA, dHDFB or 

dHDFAB strains on solid media or shake flasks. Strains dHDFA, dHDFB and dHDFAB 
1 0 transformed with plasmids pGBFINI 1-EPO or pBGFIN-PLA all secreted active enzyme 

into the medium as determined according to the following procedures. 

Solid media was the potato dextrose agar (PDA) medium (Difco, POTATO DEXTROSE 

AGAR, cultivation medium, catalogus. nr. 213400, year 1996-1997). 

Shake flask experiments were performed in 100 ml of the medium as described in EP 
15 635 574 B at 34°C and 170 rpm in an incubator shaker using a 500 ml baffeled shake 

flask. After four days of fermentation, samples were taken to determine either the proline 

specific endoprotease activity or the phospholipase activity. 

The proteolytic activity of the proline specific endoprotease was 
spectrophotometrlcally measured in time at pH 5 and about 37°C using Z-Gly(cine)- 
20 Pro(line)-pNA as a substrate. 1U proline specific endoprotease is defined as the 

amount^f-en^me-AA^^ 

pH 5 and at 37°C. 

To determine phospholipase PLA1 activity from Aspergillus niger (PLA1) 
spectrophotometrically, an artificial substrate is used: 1 ,2-dithiodioctanoyl 

25 phophatidylcholine (diC8, substrate). PLA1 hydrolyses the sulphide bond at the A1 
position, dissociating thio-octanoYc acid. Thio-octanoTc acid reacts with 4,4 
dithiopyridine (color reagent, 4-DTDP), forming 4-thiopyridone. 4-Thiopyridone is in 
tautomeric equilibrium with 4-mercaptopyridine, which absorbs radiation having a 
wavelength of 334 nm. The extinction change at that wavelength is measured. One 

30 unit is the amount of enzyme that liberates of 1 nmol thio-octanoTc acid from 1 ,2- 
dithiodioctanoyl phosphatidylcholine p£r minute at 37°C and pH 4.0. 

The substrate solution is prepared by dissolving 1 g diC8 crystals per 66 ml ethanol and 
add 264 ml acetate buffer. The acetate buffer comprises 0.1 M Acetate buffer pH 3.85 
35 containing 0.2% Triton-X100. The colour reagent is a 1 1 mM 4,4-dithiodipyridine 
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solution. It was prepared by weighting 5,0 mg 4,4-dithiodipyridine in a 2 ml eppendorf 
sample cup and dissolving in 1 .00 ml ethanol. 1 .00 ml of milli-Q water was added. 

Interestingly, morphologic changes such as color differences or colony appearance 
occurred less frequent for transformants obtained from the dHDFA, dHDFB and 
dHDFAB strains than for transformants obtained from CBS513.88. This could be due to 
reduction of random integrations (NHR) thus preventing unexpected phenotypic 
changes. 

Example 9: Isolation of PenicUlium mutants with improved efficiency for 

homologous recombination by mutagenesis 
To isolate mutants with an improved efficiency of gene targeting a combination of 
classical mutagenesis and molecular biology was applied. Penicitlium chrysogenum 
(CBS 455.95) spores were obtained from colonies sporulating in YEPD (2% Yeast 
extract from Difco, 1% pepton from Difco, 2% glucose). These spores were washed in 
sterile tap water and 10 ml of a suspension containing 10 6 conidiospores per ml was 
subjected to UV irradiation at 254 nm (Sylvania, 15 Watts Black Light Blue tube, model 
FT1 5T8/BLB). UV irradiation was applied for 7.5, 15 or 30 minutes while the 
suspensions were slowly shaken. These different irradation times were chosen to obtain 
mild, medium and strong mutation rate levels in the cells. After one hour of recovery in 
the dark, the cells from these three time points were divided in two equal aliquots. The 
first sample was directly re-sporulated as earlier described (Hersbach, GJM, Van der 
Beek, CP and Van Dijck, PWM. The Penicillins: properties, biosynthesis and 
fermentation. In: Vandamme EJ (ed) Biotechnology of Industrial Antibiotics (pp 45-140). 
Marcel Dekker, New York) and the other sample was incubated for an extended 
recovery period in YNB medium (0.67% w/v Yeast Nitrogen Base with amino acids 
(Difco), 2.0% w/v glucose) for 4 hours at 25 C before sporulation was induced (how). 
Third mutagenized samples were obtained by germinating wild type spores overnight in 
YNB, followed by two washing steps in sterile tap water and resuspended in sterile tap 
water. Again UV irradiation was applied for 7.5, 15 and 30 minutes while the 
suspensions were slowly shaken. These samples were directly re-sporulated (as 
described above) after one hour of recovery in the dark. 
To select the wanted mutants from these mutagenesized populations, the 
mutagenesized populations were inoculated in YEPD medium. After germination the 
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development of cells was followed using standard light microscopy. When the 
average hyphae of a culture was nicely developplng, cells were harvested and incubated 
with lysing enzymes to obtain protoplasts. Protoplasts were transformed with two DNA 
fragments carrying expression cassettes of functional selection markers, ble and amdS. 
The gene ble encodes a protein capable of conferring resistance to phleomycin (Kolar 
M, Punt PJ, van den Hondel CA, Schwab H. Transformation of Penicillium chrysogenum 
using dominant selection markers and expression of an Escherichia coli lacZ fusion 
gene. Gene. 1988;62(1):127-34). The gene amdS encodes a protein enabling cells to 
grow on acetamide as the sole nitrogen source (as described in EP635 
574B). Techniques applied for the transfer of DNA to protoplasts of P. chrysogenum are 
well known in the art and are described In many references, including Finkelstein and 
Ball (eds.). Biotechnology of filamentous fungi, technology and products, Butterworth- 
Heinemann (1992); Bennett and Lasure (eds.) More Gene Manipulations in fungi, 
Academic Press (1991); Turner, in: PQhler (ed), Biotechnology, second completely 
revised edition, VHC (1992). The Ca-PEG mediated protoplast transformation is used as 
described in EP635574. 

To select targeted integration of these expression cassettes to specific loci in the 
Penicillium genome short homologous stretches of DNA were added via PCR on both 
sides of the DNA fragments. Three types of construct were made: the first type contains 
homologous stretches of DNA of 30 bp, the second of 50 bp and the third of 1 0Obp. 

Selecti on was performed transforming mutants obtained from the n ine s porulated 

batches with two DNA constructs (bfe and amdS) with 30, 50 or 100 bp homologous 
strectches defining 27 distinct batches.The ble gene was targeted to the niaD locus, 
thereby disrupting the nitrate reductase gene (Gouka RJ, van Hartingsveldt W, 
Bovenberg RA, van den Hondel CA, van Gorcom RF. Cloning of the nitrate-nitrite 
reductase gene cluster of Penicillium chrysogenum and use of the niaD gene as a 
homologous selection marker. J Biotechnol. 1991 Sep;20(2): 189-99), enabling direct 
selection of transformants on plates containing chlorate, as cells become resistant to 
chlorate. The amdS gene was targeted to the sutB locus, thereby disrupting the sulphate- 
permease gene (van de Kamp M, Pizzinini E, Vos A, van der Lende TR, Schuurs TA, 
Newbert RW, Turner G, Konings WN, Driessen AJ. Sulfate transport in Penicillium 
chrysogenum: cloning and characterization of the sutA and sutB genes. J. Bacteriol. 
1999 Dec;181(23):7228-34), enabling direct selection of transformants on plates 
containing selenate. Transformants were first selected on chlorate and then tested for 
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selenate. Furthermore, the presence of the selection markers was demonstrated by 
growth on plates containing acetamide as sole nitrogen source (EP635574B) and 
subsequently on plates containing phleomycine. As control wild type P. chrysogenum 
CBS 455.95 was also transformed with the same DNA fragments. Mutants with both 
selection markers present and resistant against both chlorate and selenate are strains 
with improved targeted integration. 
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OZ 04. 2004 

CLAIMS 




1 . Method for increasing the efficiency of targeted integration of a polynucleotide to 
a pre-determined site into the genome of a filamentous fungal cell with a 
preference for NHR, wherein said polynucleotide has a region of homology with 
said pre-determined site, comprising steering an integration pathway towards 
HR. 

2. The method of claim 1 , wherein the steering comprises providing a mutant of a 
parent filamentous fungal cell, wherein the ratio of NHR/HR is decreased in the 
mutant as compared to said ratio in said parent organism measured under the 
same conditions. 

3. The method of claiml or 2, wherein the steering comprises providing a mutant 
which is deficient in a gene encoding a component involved in NHR, and/or has s 
decreased level of a component involved in NHR. 

4. The method of claim 3, wherein the mutant is, preferably inducibly, deficient in at 
least one of the following genes: hdfA or homologous thereof, hdfB or 
homologous thereof, or both, and/or has, preferably inducibly, a decreased 

• ♦ 

amount of at least one of the proteins encoded by these genes. 

5. The method of claim 3 or 4, wherein a gene involved In NHR has been replaced 
by a non-functional variant. 

6. The method according to any one of claims 1 to 5, wherein the steering 
comprises adding an excess of small double stranded polynucleotides to the 
polynucleotide to be integrated. 

7. The method according to any one of claims 1 to 6, wherein the steering 
comprises decreasing the activity of at least one protein active in the NHR by 
adding an inhibitor of said protein(s). * 
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8. The method according to any one of claims 1 to 7, wherein the mutant has an 
increased level of a component involved in HR. 

9. A mutant of a parent filamentous fungal cell, the parent organism having a 
preference for NHR, wherein the ratio of NHR/HR is decreased in the mutant as 
compared to said ratio in said parent organism measured under the same 
conditions. 

10. The mutant according to claim 9, wherein the mutant is deficient in a gene 
encoding a component involved in NHR, and/or has a decreased level of a 
component involved in NHR. 

1 1 . The mutant according to claim 1 0, wherein the mutant is, preferably inducibly , 
deficient in at least one of the following genes: hdfA or homologous thereof, hdfB 
or homologous thereof, or both, and/or has, preferably inducibly, a decreased 
amount of at least one of the proteins encoded by these genes. 

12. The mutant according to claim 9 to 11, wherein in the genome of the organism a 
gene involved in NHR has been replaced by a non-functional variant. 

1 3. The mutant according to any one o f claims 9 to lJ2>_whemicLthe-mLitant-h^-an — 
increased level of a component involved in HR. 

14. The mutant according to any one of claims 9 to 13, wherein the mutant is a 
recombinant mutant. 

1 5. The mutant according to any one of claims 9 to 14 transformed with a DNA 
construct comprising a DNA sequence comprising a gene of interest encoding a 
polypeptide of interest. 

16. The mutant according to any one of claims 9 to 15, wherein the filamentous 
fungus is an Aspergillus, Penicillium or Trlchoderma species. 
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17. The mutant according to claim 16, wherein the Aspergillus is an Aspergillus niger 
or an Aspergillus oryzae species. 

18. The mutant according to claim 16, wherein the Penlcillium is a Penicillium 
chrysogenum or Penicillium citrinum species. 

19. Method for producing a polypeptide of interest, wherein the mutant of any one of 
claim 1 5 to 1 8 is used. 

20. Method for producing a metabolite, wherein the mutant of any one of claim 1 5 to 
1 8 is used. 

21. Method according to claim 20, wherein the metabolite is a carotenold compound 
or a beta-lactam compound. 

* 

22. Isolated DNA sequences having SEQ ID NO 2 or 5 or homologous thereof. 

23. Isolated polypeptides encoded by the DNA sequences of claim 22 or homologous 
thereof. 



EPO - DG 1 

DSM IP Assets B.V. 24181EP/P0/ 

02. OA, 2004 




Filamentous fungal mutants with improved homologous 
5 recombination efficiency 

ABSTRACT 

The present invention relates to a method for increasing the efficiency of targeted 
1 0 Integration of a polynucleotide to a pre-determined site into the genome of a filamentous 
fungal cell with a preference for NHR, wherein said polynucleotide has a region of 
homology with said pre-determined site, comprising steering an integration pathway 
towards HR. The present invention also relates to a mutant filamentous fungus 
originating from a parent cell, said mutant having an HR pathway with elevated efficiency 
15 and/or an NHR pathway with a lowered efficiency and/or a NHR/HR ratio with decreased 
efficiency as compared to said HR and/or NHR efficiency and/or NHR/HR ratio of said 
parent cell under the same conditions. 
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Figure 1: Plasmidmap of replacement vector pDEL-HDFA . Indicated are the ^A«anWng 
regions relative to the amdS marker. The E. co/i DNA was removed by d.gesbon with restriction 
enzymes AscI and Notl, with prior to transformation of the A. niger strains. 
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Figure 2: Plasmid map of replacement vector pDEL-HDFB. Indicated are the hdfB flanking 
regions relative to the amdS marker. The E. coli DNA was removed by digestion with restriction 
enzymes AscI and Notl, with prior to transformation of the A. nlger strains. 
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Gene deletion of the hdfA gene 




Figure 3: Schematic presentation of the hdfA deletion. A DNA construct comprising the 
amdS selection marker flanked by homologous regions (5' and 3') of the hdfA gene (1), 
integrates through double homologous recombination (X) at the genomic hdfA locus (2) and 
replaces the genomic hdfA gene copy (3). Subsequently, recombination over the direct repeats 
(U) removes the a/ndS marker, resulting in precise excision of the hdfA gene (4). 
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Gene deletion of the hdfB gene 




Figure 4: Schematic presentation of the hdfB deletion. A DNA construct comprising the 
amdS selection marker flanked by homologous regions (5' and 3") of the hdfB gene (1), 
integrates through double homologous recombination (X) at the genomic hdfB locus (2) and 
replaces the genomic hdfB gene copy (3), Subsequently, recombination over the direct repeats 
(U) removes the amdS marker, resulting in precise excision of the hdfB gene (4). 
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Figure 5: Schematic presentation of integration through single homologous 
recombination. The expression vector comprises the selectable amdS marker and gene 
of interest. These are flanked by homologous regions of the glaA locus (3 glaA and 3 
glaA respectively) to direct integration at the genomic locus. 
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SEQUENCE LISTING 
<110> DSM IP Assets B.V. 

<12 0> FILAMENTOUS FUNGAL MUTANTS WITH IMPROVED HOMOLOGOUS RECOMBINATION 
EFFICIENCY 

<130> 24181EP/P0/ 

<160> 8 

<17 0> Patentln version 3.1 

<210> 1 
<211> 2284 
<212> DNA 

<2I3> Aspergillus niger 
<400> 1 

atggcggacg gcaacccaca tcgggaagat gaggcggccg aggaagaaga ggagattgat 60 
gagactgtac gcaaatttac ccatgaactt ggactggaac tctggaactg acaataagat 120 
cagagctaca aaccagtcaa agatgcggtc ctcttcgcaa tcgatgtcag cgattccatg 180 
ttgacgcctc gcccctcagc agatcctaag aaacacaccc aagaatcacc caccacggca 240 
gcgctcaaat gcgcctatca cttcatgcaa caacgaatca tatcaaatcc acaagacatg 
atgggtgttt tgctgttcgg gacccaggcg tccaagttct ttgaagaaga tgaagacagt 360 
cggggagacc tgtcctaccc caactgctac ctcttcactg atctggatgt tccttcggct 420 
catgaggtca aaggacttcg agcactggta gatgatgaag gagactcaag ggaggttcta 480 
tctccagcga aagagcaggt ctctatggca aacgtcctat tttgcgccaa ccagatattc 540 
acatccagag cgccaaattt cctctcccgg, cgtttgttca tcataaccga caatgacaac 
ccccatggtg atgataaaac cctgcggtca gcggcgactg tacgtgctaa ggatctttac 
gatcttggtg tcacaattga gctgtttccg atctcacgcc ctgagcatga gttcaagaac 720 
agcaagttct atgacgtaag ctatcatact ctatagcaaa gtggcagggg tcgatactca 780 
ctacagatac aaaggatatt atctacaagt cattgcccag cgatccagag gcgcctgcat 
atctacaatc tgattcaaaa gcggcgactg cgaccgggga cgggatttca ctcctcaaca 
cgcttctgtc cagtattaat tcgagaacgg ttccgcgtcg cactcatttt tcgaacatgc 
ctttagaact tggcccagac ttcagaattt cggtatcggg ctatatactc ttacgaaggc 1020 
aagcgcccgc tagaaactcc ttcatctggc tgaacggcga gaagcctgtg gtcgcgaaag 1080 
gagtgacttc ccactccgca gatgatactg gccggactgt cgagaaatgg gagatcagaa 1140 



600 
660 



840 
900 
960 
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aggcatataa gttcggtggc gaccaagtaa ccttttcgcc tgatgagcag aaggcgctta 1200 

gggatttcgg tgagccagta atccgggtta ttgggttcaa gcctatcact gcgcttccat 1260 

tctgggcaaa cgtcaagcac ccatatttta tctatccatc cgaggaagac tatgtaggct 1320 

cctcgcgagt attttccgca ttgcatcaga ctcttttgcg ttccaagaag atggcactcg 1380 

tctggttcat tgcacgcaag ggtgctggcc ccgttctcgc cgctatgatc gcaggcgaag 1440 

aaaagcttga tgagaatggc gtacaaaaat accctcctgg catgtggatt cttcccctcc 1500 

ccttcgcaga cgatatccgg cagaaccccg aaacaacgtt gaatgtcgcc ccggagtcat 1560 

tgattgatca gatgcgcgtg atcgtccagc aactgcagct gccgaaggga gtgtacgagc 1620 

ctctcaaata ccccaatcca tgtaagtcac ttctgtcttg cattgctcgt atacgatgaa 1680 

cgagaagctg acagcccgtg atcagccctt caatggcatt accgcatcct acaagctctc 1740 

gcattagacg aagatctccc cgaaaaacca gaagacaaaa ccattccgaa ataccgccaa 1800 

atcgacaagg taaatccacc acacccaaca cgagaaataa ccctccaggc gtccaactta 1860 

ctgacaattg caccacagcg cgccggtgac tacgtattat cctgggccga cgaactcgaa 1920 

aagcaatacg ccaaaacctc agcagcggcc cctcgcccaa ccagcaccct cgtgaaacga 1980 

ggatcaaaag accgagcaag cgaaaccgag gactccaagc catcgaaaaa gatcaaggtt 204 0 

gaggaagact ctggaagcct agaggaggaa gtccgcaggc atcacaagaa gggaacgcta 2100 

tccaaggtaa gccaccacag gctttctaca cgtcctcgtg atggcaaata tgacatcgta 2160 

ttaaccggcg gttttctagc ttacggtcqc tatcctcaaa qactt cttaa cttccaatctg 222J0 



acgctcaaat gccggtaaga aggcggatct. tattgagcgg gtagaggagt tcttggagca 2280 
gtga 2284 

<210> 2 
<211> 1947 
<212> DNA 

<213> Aspergillus niger 
<400> 2 

atggcggacg gcaacccaca tcgggaagat gaggcggccg aggaagaaga ggagattgat 60 

gagactagct acaaaccagt caaagatgcg gtcctcttcg caatcgatgt cagcgattcc 120 

■ 

atgttgacgc ctcgcccctc agcagatcct aagaaacaca cccaagaatc acccaccacg 180 

gcagcgctca aatgcgccta tcacttcatg caacaacgaa tcatatcaaa tccacaagac 24 0 

atgatgggtg ttttgctgtt cgggacccag gcgtccaagt tctttgaaga agatgaagac 300 



agtcggggag 


acctgtccta 


ccccaactgc 


tacctcttca 


ctgatctgga 


tgttccttcg 


360 


gctcatgagg tcaaaggact tcgagcactg gtagatgatg aaggagactc aagggaggtt 


420 


ctatctccag 


cgaaagagca 


ggtctctatg 


gcaaacgtcc 


tattttgcgc 


caaccagata 


480 


ttcacatcca 


gagcgccaaa 


tttcctctcc 


cggcgtttgt 


tcatcataac cgacaatgac 


540 


aacccccatg 


gtgatgataa 


aaccctgcgg 


tcagcggcga 


ctgtacgtgc 


taaggatctt 


600 


tacgatcttg 


gtgtcacaat 


tgagctgttt 


ccgatctcac 


gccctgagca 


tgagttcaag 


660 


aacagcaagt 


tctatgactc 


attgcccagc 


gatccagagg 


cgcctgcata 


tctacaatct 


720 


gattcaaaag 


cggcgactgc 


gaccggggac 


gggatttcac 


tcctcaacac 


gcttctgtcc 


780 


agtattaatt 


cgagaacggt 


tccgcgtcgc 


actcattttt 


cgaacatgcc 


tttagaactt 


840 


ggcccagact 


tcagaatttc 


ggtatcgggc 


tatatactct 


tacgaaggca 


agcgcccgct 


900 


agaaactcct 


tcatctggct 


gaacggcgag 


aagcctgtgg 


tcgcgaaagg 


agtgacttcc 


960 


cactccgcag 


atgatactgg 


ccggactgtc 


gagaaatggg 


agatcagaaa 


ggcatataag 


1020 


ttcggtggcg 


accaagtaac 


cttttcgcct 


gatgagcaga 


aggcgcttag 


ggatttcggt 


1080 


gagccagtaa 


tccgggttat 


tgggttcaag 


cctatcactg 


cgcttccatt 


ctgggcaaac 


1140 


gtcaagcacc 


catattttat 


ctatccatcc 


gaggaagact 


atgtaggctc 


ctcgcgagta 


1200 


ttttccgcat 


tgcatcagac 


tcttttgcgt 


tccaagaaga 


tggcactcgt 


ctggttcatt 


1260 


gcacgcaagg 


gtgctggccc 


cgttctcgcc 


gctatgatcg 


caggcgaaga 


aaagcttgat 


1320 


gagaatggcg 


tacaaaaata 


ccctcctggc 


atgtggattc 


ttcccctccc 


cttcgcagac 


1380 


gatatccggc 


agaaccccga 


aacaacgttg 


i 

aatgtcgccc 


cggagtcatt 


gattgatcag 


1440 


atgcgcgtga 


tcgtccagca 


actgcagctg 


ccgaagggag 


tgt acgagcc 


tctcaaatac 


1500 


cccaatdcat 


cccttcaatg 


gcattaccgc 


atcctacaag 


ctctcgcatt 


agacgaagat 


1560 


ctccccgaaa 


aaccagaaga 


caaaaccatt 


ccgaaatacc 


gccaaatcga 


caagcgcgcc 


1620 


ggtgactacg 


tattatcctg 


ggccgacgaa 


ctcgaaaagc 


aatacgccaa 


aacctcagca 


1680 


gcggcccctc 


gcccaaccag 


caccctcgtg 


aaacgaggat 


caaaagaccg 


agcaagcgaa 


1740 


accgaggact 


ccaagccatc 


gaaaaagatc 


aaggttgagg 


aagactctgg 


aagcctagag 


1800 


gaggaagtcc 


gcaggcatca 


caagaaggga 


acgctatcca 


agcttacggt 


cgctatcctc 


1860 


aaggacttct 


tgacttccaa 


tggacgctca 


aatgccggta 


agaaggcgga 


tcttattgag 


1920 


cgggtagagg 


agttcttgga 


gcagtga 




> 




1947 



Best Available Copy 



<2io> 3 ; 

<211> 648 
<212> PRT 

<213> Aspergillus niger 
<400> 3 

Met Ala Asp Gly Asn Pro His Arg Glu Asp Glu Ala Ala Glu Glu Glu 
15 10 15 



Glu Glu lie Asp Glu Thr Ser Tyr Lys Pro Val Lys Asp Ala Val Leu 

20 25 " 30 



Phe Ala lie Asp Val Ser Asp Ser Met Leu Thr Pro Arg Pro Ser Ala 
35 40 45 



Asp Pro Lys Lys His Thr Gin Glu Ser Pro Thr Thr Ala Ala Leu Lys 
50 55 60 



Cys Ala Tyr His Phe Met Gin Gin Arg lie He Ser Asn Pro Gin Asp 
65 70 75 80 



Met Met Gly Val Leu Leu Phe Gly Thr Gin Ala Ser Lys Phe Phe Glu 

85 " 90 95 



Glu Asp Glu Asp Ser Arg Gly Asp Leu Ser Tyr Pro Asn Cys Tyr Leu 

100 105 110 



Phe Thr Asp Leu Asp Val Pro Ser Ala His Glu Val Lys Gly Leu Arg 
115 " 120 125 



Ala Leu Val Asp Asp Glu Gly Asp Ser Arg Glu Val Leu Ser Pro Ala 
130 135 140 



Lys Glu Gin Val Ser Met Ala Asn Val Leu Phe Cys Ala Asn Gin He 
145 150 155 160 



Phe Thr Ser Arg Ala Pro Asn Phe Leu Ser Arg Arg Leu Phe He He 

165 170 175 



Thr Asp Asn Asp Asn Pro His Gly Asp Asp Lys Thr Leu Arg Ser Ala 

180 185 190 



Ala Thr Val Arg Ala Lys Asp Leu Tyr Asp Leu Gly Val Thr He Glu 
195 " 200 205 



Leu Phe Pro lie Ser Arg Pro Glu His Glu Phe Lys Asn Ser Lys Phe 
210 215 220 



Tyr Asp Ser Leu 
225 



Asp Ser Lys Ala 



Thr Leu Leu Ser 

260 



Phe Ser Asn Met 
275 



Ser Gly Tyr He 
290 



He Trp Leu Asn 
305 



His Ser Ala Asp 



Lys Ala Tyr Lys 

340 



Gin Lys Ala Leu 
355 



Phe Lys Pro He 
370 



Tyr Phe lie Tyr 
385 



Phe Ser Ala Leu 



Val Trp Phe He 

420 



Pro Ser Asp Pro 
230 



Ala Thr Ala Thr 
245 



Ser He Asn Ser 



Pro Leu Glu Leu 

280 



Leu Leu Arg Arg 
295 



Gly Glu Lys Pro 
310 



Asp Thr Gly Arg 
325 



Phe Gl'y Gly Asp 



Arg Asp Phe Gly 

360 



Thr Ala Leu Pro 
375 



Pro Ser Glu Glu 
390 



His Gin Thr Leu 
405 



Ala Arg Lys Gly 



Glu Ala Pro Ala 
235 



Gly Asp Gly He 
250 



Arg Thr Val Pro 
265 



Gly Pro Asp Phe 



Gin Ala Pro Ala 

300 



Val Val Ala Lys 
315 



Thr Val Glu Lys 
330 



Gin Val Thr Phe 
345 



Glu Pro Val He 



Phe Trp Ala Asn 

380 



Asp Tyr Val Gly 
395 



Leu Arg Ser Lys 
410 



Ala Gly Pro Val 
425 



Tyr Leu Gin Ser 

240 



Ser Leu Leu Asn 
255 



Arg Arg Thr His 
270 



Arg He Ser Val 
285 



Arg Asn Ser Phe 



Gly Val Thr Ser 

320 



Trp Glu He Arg 
335 



Ser Pro Asp Glu 
350 



Arg Val He Gly 
365 



Val Lys His Pro 



Ser Ser Arg Val 

4 00 



Lys Met Ala Leu 
415 



Leu Ala Ala Met 
430 



Best Available Copy 



He Ala Gly Glu Glu Lys Leu Asp Glu Asn Gly Val Gin Lys Tyr Pro 
435 440 445 



Pro Gly Met Trp He Leu Pro Leu Pro Phe Ala Asp Asp lie Arg Gin 
450 455 460 



Asn Pro Glu Thr Thr Leu Asn Val Ala Pro Glu Ser Leu lie Asp Gin 
465 470 475 480 



Met Arg Val He Val Gin Gin Leu Gin Leu Pro Lys Gly Val Tyr Glu 

485 490 495 



Pro Leu Lys Tyr Pro Asn Pro Ser Leu Gin Trp His Tyr Arg He Leu 

500 505 510 



Gin Ala Leu Ala Leu Asp Glu Asp Leu Pro Glu Lys Pro Glu Asp Lys 
515 520 525 



Thr He Pro Lys Tyr Arg Gin He Asp Lys Arg Ala Gly Asp Tyr Val 
530 * 535 540 



Leu Ser Trp Ala Asp Glu Leu Glu Lys Gin Tyr Ala Lys Thr Ser Ala 
545 550 555 560 



Ala Ala Pro Arg Pro Thr Ser Thr Leu Vai Lys Arg Gly Ser Lys Asp 

565 570 575 



Arg Ala Ser Glu Thr Glu Asp Ser Lys Pro Ser Lys Lys He Lys Val 

580 585 590 



Glu Glu Asp Ser Gly Ser Leu Glu Glu Glu Val Arg Arg His His Lys 
595 600 605 



Lys Gly Thr Leu Ser Lys Leu Thr Val Ala lie Leu Lys Asp Phe Leu 
610 615 620 



Thr Ser Ash Gly Arg Ser Asn Ala Gly Lys Lys Ala Asp Leu He Glu 
625 630 635 640 



Arg Val Glu Glu Phe Leu Glu Gin 

645 



600 



<210> 4 
<211> 2651 
<212> DNA 

<213> Aspergillus niger 

<400> 4 cn 

atggccgata aagaggcaac tgtctacatc gtggactgcg gcaagtccat gggggagcgg 60 

cgtcatggtc gcgaagtgac ggatctcgac tgggcgatgc aatatgtttg ggatcgtatt 120 

acagggacgg tgagatcctt attcttgaga atcatatcat acatgaaagc ttatgttttg 180 

gataggtggc cactggacga aaaatggctt tgatcggtgt tcttgggctc aggacagatg . 24 0 

gtgagtgact agcctcccgg gtacagttgg tagttgtagt ttgctggtcg gggctaatgc 300 

aggaacgtcc agaaaccgct aatgagttgg aggatgatcc tgattattcg catatctcgg 360 

ttttgtctgg gattaaacag tatgattcat ttttgtctgc tgatcctctg gttattcgct 420 

gatgaactat aggtttctta tgccggatat ccggggtttg agcgaccgaa taaagcctag 480 

caagactaat aagggagatg gtgagttact cttcttgtat ggaattggag tgattggggc 54 0 

tgagccgatg aatatagcta tctctgcact tgtgctcgcg attcagatga ttatcactca 

gtgcaagaaa ctgaagtaca agcgcaggat tgtcctggtt actaatgggc agggcccgat 660 

gaacccggat aatcttagtg aaataacgaa gaagattaag gaggataaca ttgaacttat 720 

tattctgtta gtgtcaattg atacactgag agaaccgggg tactaacatg ctgcagggga 780 

ccagactttg atgatcctga atatggggtg aaagaggaag ataaagatcc gcgaaaggta 

tttaacttcg ttccatatgc tctagactaa taataacaat ggctacaggc cgaaaatgaa 

acactcctgc gtagtcttgc cgaagactgc gaaggagcct atggaaccct agaacaagct 

gttgcggagc tggaaactcc tcgtgtgaaa accacaagga taacagcaag cttcaagggc 

catttgcaac taggaaaccc cgcagaatat gatactgcag ttcggatccc tgtggagcgc 1080 

tactacagga catacgttgc aaaagctccg tcggctagtc agttcacagt acgtaacgaa 114 0 

gaggagatgg gaatggccgc ggccgcagcc ggctcgcagg aaggtagttc ccttgtgggt 1200 

gttcgaaaca acaggtccta ccaaattgac gatgggacta ctgaagaagg ggtgagggac 1260 

gtggatcgag agcaacttgc caagggttat gagtacgggc ggacattggt ccctattagc 1320 

gagacggatg agaatatcac caccctagag acatttgcgg' ctatcgagct tcttgggttt 1380 

atacagagcg atcgggtgag ttctaccctc caataactgt tattatgctg ctaagtgggt 14 40 

tttgccatta gtatgatcga tacatgcaca tgtcgacgac aaacatcatc atcgcgcagc 1500 

gcgcgaatga caaggcagca ctcgctcttt cctctttcat acatgcgctg ttcgagctgg 1560 



840 
900 
960 
1020 



Best Available Copy 



aatcgtacgc 


tgtcgcccgt 


atggtgctaa 


aggagaacaa 


accccctgtc 


atagtcgtgc 


1620 


ttgcgccatc 


aatcgaaccc 


gactacgagt 


gtctcctcga 


agcgcagttg 


ccattcgcag 


1680 


aagacgtacg 


aacgtaccgc 


ttccctccac 


tcgacagagt 


cattacagtg 


tctggtaaag 


1740 


tggtgacaca 


gcatcgaaac 


ctacccaacg 


acgatctgtt 


gaatgcgatg 


gacaaatacg 


1800 


tgaaaagcat 


ggagcttacc 


gatatggacg 


agaaegggtg 


agaagaattg 


gaagtgatct 


1860 


caacttcact 


gctgactttg 


tacaaagtga 


cccgacggaa 


tctctcccaa 


tagacgactc 


1920 


tttctctcca 


gtcctgcacc 


ggatcgactc 


cgcaatccgt 


caccgtgcca 


ttcatcccaa 


1980 


cyaccctatc 


ccgcccccag 


cctcagtcct 


aacgaagttc 


tcccaccctc 


eggatgaect 


2040 


cgtcgagaag 


tccaagaaat 


acctagacaa 


gctagtagca 


gtgtcggacg 


tcaagaaagg 


2100 


tcagtccatc 


tcggccttga 


gectcttagg 


cccccatcat 


actcacagtg 


atgaatctag 


2160 


tcccaccaaa 


aaccaaaggc 


accaaaegga 


cccgcgaaac 


egagaageca 


ctatccggtc 


2220 


tcgacgucga 


/"■■» r—* y*^ +~ +~ 4* 

ugcccuu cue 


caccaagaga 


agcycacyaa 


yalCuCdCCC 


aacaaegcaa 




ttcccgagtt 


taagcagacg 


ctctcgcagg 


cagagaacat 


cgagatcatc 


aaggatgcag 


2340 


tgaagcagat 


gagcactatc 


attgaagacc 


aaatcaggca 


tagtcttggc 


gatgttaatt 


2400 


atcatcgggt 


cactgagggg 


ctaggtgtga 


tgcgggagga 


actgatcgat 


tatgaggaac 


2460 


ctgctctgta 


taacgatttc 


ttgaagcagc 


tgaaggagaa 


gttgttgaaa 


gaggagctcg 


2520 


gtggggatcg 


aegggagctg 


tggtggctgc 


taagaaggag 


taagttgggg 


ttgattgaac 


2580 


agagggagtc 


ggaacactct 


gaggtgagag 


aagaggaagc 


gaaggcgttt 


atgtctatgg 


2640 


ctgctaagtg 


a 










2651 



<210> 5 
<211> 2178 
<212> DNA 

<213> Aspergillus niger 
<400> 5 

atggccgata aagaggcaac tgtctacatc gtggactgcg gcaagtccat gggggagegg 60 
cgtcatggtc gcgaagtgac ggatctcgac tgggcgatgc aatatgtttg ggategtatt 120 
acagggaegg tggccactgg acgaaaaatg gctttgatcg gtgttcttgg gctcaggaca 180 
gatgtttgct ggtegggget aatgcaggaa cgtccagaaa ccgctaatga gttggaggat 24 0 
gatcctgatt attegcatat ctcggttttg tctgggatta aacagtttct tatgeeggat 300 
ateeggggtt tgagcgaccg aataaagect agcaagacta ataagggaga tgetatctet 360 



gcacttgtgc tcgcgattca gatgattatc actcagtgca agaaactgaa gtacaagcgc 420 

aggattgtcc tggttactaa tgggcagggc ccgatgaacc cggataatct tagtgaaata 480 

acgaagaaga ttaaggagga taacattgaa cttattattc tgggaccaga ctttgatgat 54 0 

cctqaatatg gggtgaaaga ggaagataaa gatccgcgaa aggccgaaaa tgaaacactc 600 

ctgcgtagtc ttgccgaaga ctgcgaagga gcctatggaa ccctagaaca agctgttgcg 660 

gagctggaaa ctcctcgtgt gaaaaccaca aggataacag caagcttcaa gggccatttg 720 

caactaggaa accccgcaga atatgatact gcagttcgga tccctgtgga gcgctactac 780 

aggacatacg^ttgcaaaagc tccgtcggct agtcagttca cagtacgtaa cgaagaggag 84 0 

atgggaatgg ccgcggccgc agccggctcg caggaaggta gttcccttgt gggtgttcga 900 

aacaacaggt cctaccaaat tgacgatggg actactgaag aaggggtgag ggacgtggat 960 

cgagagcaac ttgccaaggg ttatgagtac gggcggacat tggtccctat tagcgagacg 1020 

gatgagaata tcaccaccct agagacattt gcggctatcg agcttcttgg gtttatacag 1080 

agcgatcggt atgatcgata catgcacatg tcgacgacaa acatcatcat cgcgcagcgc 1140 

gcgaatgaca aggcagcact cgctctttcc tctttcatac atgcgctgtt cgagctggaa 1200 

tcgtacgctg tcgcccgtat ggtgctaaag gagaacaaac cccctgtcat agtcgtgctt 1260 

gcgccatcaa tcgaacccga ctacgagtgt ctcctcgaag cgcagttgcc attcgcagaa 1320 

gacgtacgaa cgtaccgctt ccctccactc gacagagtca ttacagtgtc tggtaaagtg 1380 

gtgacacagc atcgaaacct acccaacgac gatctgttga atgcgatgga caaatacgtg 1440 

aaaagcatgg agcttaccga tatggacgag aacggtgacc cgacggaatc tctcccaata 1500 

gacgactctt tctctccagt cctgcaccgg atcgactccg caatccgtca ccgtgccatt 1560 

catcccaacg accctatccc gcccccagcc tcagtcctaa cgaagttctc ccaccctccg 1620 

gatgacctcg tcgagaagtc caagaaatac ctagacaagc tagtagcagt gtcggacgtc 1680 

aagaaagtcc caccaaaaac caaaggcacc aaacggaccc gcgaaaccga gaagccacta 1740 

tccggtctcg acgtcgatgc ccttctccac caagagaagc gcacgaagat ctcacccaac 1800 

aacgcaattc ccgagtttaa gcagacgctc tcgcaggcag agaacatcga gatcatcaag 1860 

gatgcagtga agcagatgag cactatcatt gaagaccaaa tcaggcatag tcttggcgat 1920 

gttaattatc atcgggtcac tgaggggcta ggtgtgatgc gggaggaact gatcgattat 1980 

gaggaacctg ctctgtataa cgatttcttg aagcagctga aggagaagtt gttgaaagag 204 0 



Best Available Copy 



gagctcggtg gggatcgacg ggagctgtgg tggctgctaa gaaggagtaa gttggggttg ■ 2100 
attgaacaga gggagtcgga acactctgag gtgagagaag aggaagcgaa ggcgtttatg 2160 
tctatggctg ctaagtga 2178 

<210> 6 
<211> 725 
<212> PRT 

<213> Aspergillus niger 
<400> 6 

Met Ala Asp Lys Glu Ala Thr Val Tyr lie Val Asp Cys Gly Lys Ser 
15 10 15 

Met Gly Glu Arg Arg His Gly Arg Glu Val Thr Asp Leu Asp Trp Ala 

20 25 30 

Met Gin Tyr Val Trp Asp Arg lie Thr Gly Thr Val Ala Thr Gly Arg 
35 40 45 

Lys Met Ala Leu He Gly Val Leu Gly Leu Arg Thr Asp Val Cys Trp 
50 55 60 

Ser Gly Leu Met Gin Glu Arg Pro Glu Thr Ala Asn Glu Leu Glu Asp 
65 70 75 80 

Asp Pro Asp Tyr Ser His He Ser Val Leu Se r Gly He Lys Gin Phe • 

B"5 9T5 S5 



Leu Met Pro Asp He Arg Gly Leu Ser Asp Arg He Lys Pro Ser Lys 

100 105 110 



Thr Asn Lys Gly Asp Ala He Ser Ala Leu Val Leu Ala lie Gin Met 
115 120 125 



He He Thr Gin Cys Lys Lys Leu Lys Tyr Lys Arg Arg He Val Leu 
130 * 135 140 



Val Thr Asn Gly Gin Gly Pro Met Asn Pro Asp Asn Leu Ser Glu He 
145 150 155 160 



Thr Lys Lys He Lys Glu Asp Asn He Glu Leu He He Leu Gly Pro 

165 170 175 



■J 



Asp Phe Asp Asp Pro Glu Tyr Gly Val Lys Glu Glu Asp Lys Asp Pro 

180 185 190 



Arg Lys Ala Glu Asn Glu Thr Leu Leu Arg Ser Leu Ala Glu Asp Cys 
195 200 205 



Glu Gly Ala Tyr Gly Thr Leu Glu Gin Ala Val Ala Glu Leu Glu Thr 
210 215 220 



Pro Arg Val Lys Thr Thr Arg lie Thr Ala Ser Phe Lys Gly His Leu 
225 230 235 240 



Gin T.p.n Gly Asn Pro Ala Glu Tyr Asp Thr Ala Val Arg lie Pro Val 

245 ~ 250 255 



Glu Arg Tyr Tyr Arg Thr Tyr Val Ala Lys Ala Pro Ser Ala Ser Gin 

260 " 265 270 



Phe Thr Val Arg Asn Glu Glu Glu Met Gly Met Ala Ala Ala Ala Ala 
275 280 285 



Gly Ser Gin Glu Gly Ser Ser Leu Val Gly Val Arg Asn Asn Arg Ser 
290 295 300 



Tyr Gin He Asp Asp Gly Thr Thr Glu Glu Gly Val Arg Asp Val Asp 
305 310 315 320 



Arg Glu Gin Leu Ala Lys Gly Tyr Glu Tyr Gly Arg Thr Leu Val Pro 

325 330 335 



He Ser Glu Thr Asp Glu Asn He Thr Thr Leu Glu Thr Phe Ala Ala 

340 345 350 



He Glu Leu Leu Gly Phe He Gin Ser Asp Arg Tyr Asp Arg Tyr Met 
355 360 365 



His Met Ser Thr Thr Asn He He He Ala Gin Arg Ala Asn Asp Lys 
370 375 380 



Ala Ala Leu Ala Leu Ser Ser Phe He His Ala Leu Phe Glu Leu Glu 
385 390 395 400 



Best Available Copy 



Ser Tyr Ala Val Ala Arg Met Val Leu Lys Glu Asn Lys Pro Pro Val 

405 410 415 



lie Val Val Leu Ala Pro Ser lie Glu Pro Asp Tyr Glu Cys Leu Leu 

420 425 430 



Glu Ala Gin Leu Pro Phe Ala Glu Asp Val Arg Thr Tyr Arg Phe Pro 
435 440 445 



Pro Leu Asp Arg Val He Thr Val Ser Gly Lys Val Val Thr Gin His 
450 455 460 

Arg Asn Leu Pro Asn Asp Asp Leu Leu Asn Ala Met Asp Lys Tyr Val 
465 470 475 480 

Lys Ser Met Glu Leu Thr Asp Met Asp Glu Asn Gly Asp Pro Thr Glu 

485 490 495 



Ser Leu Pro He Asp Asp Ser Phe Ser Pro Val Leu His Arg He Asp 

500 505 510 



Ser Ala He Arg His Arg Ala He His Pro Asn Asp Pro He Pro Pro 
515 520 525 

Pro Ala Ser Val Leu Thr Lys Phe Ser His Pro Pro Asp Asp Leu Val 
530 535 540 



Glu Lys Ser Lys Lys Tyr Leu Asp Lys Leu Val Ala Val Ser Asp Val 
545 550 555 560 

Lys Lys Val Pro Pro Lys Thr Lys Gly Thr Lys Arg Thr Arg Glu Thr 

565 570 575 

Glu Lys Pro Leu Ser Gly Leu Asp Val Asp Ala Leu Leu His Gin Glu 

580 585 590 

Lys Arg Thr Lys He Ser Pro Asn Asn Ala He Pro Glu Phe Lys Gin 
595 600 605 

Thr Leu Ser Gin Ala Glu Asn He Glu He He Lys Asp Ala Val Lys 
610 615 620 



Gin Met Ser Thr He He Glu Asp Gin He Arg His Ser Leu Gly Asp 



625 630 635 640 

« 

Val Asn Tyr His Arg Val Thr Glu Gly Leu Gly Val Met Arg Glu Glu 

645 650 655 

Leu lie Asp Tyr Glu Glu Pro Ala Leu Tyr Asn Asp Phe Leu Lys Gin 

660 665 670 

Leu Lys Glu Lys Leu Leu Lys Glu Glu Leu Gly Gly Asp Arg Arg Glu 
675 680 685 

Leu Trp Trp Leu Leu Arg Arg Ser Lys Leu Gly Leu He Glu Gin Arg 
690 695 700 

Glu Ser Glu His Ser Glu Val Arg Glu Glu Glu Ala Lys Ala Phe Met 
705 710 715 720 



Ser Met Ala Ala Lys 

725 



<210> 7 
<211> 4501 
<212> DNA 

<213> Aspergillus niger 

► 

<400> 7 

cagctcattc agagagctac ccgtagtaga acaggaatac tgggggtatt gtgagaacgc 60 
gaccgcacga ccgcccttcc cattgccaaa gccatcttcc agcaattgtg tgtacatttg 120 
ttccgtcagc gggttggcgt aacggaaggc aacgtacggc ttgtgaggcg cagtctccgg 180 
gttgatcttg tccagcagct tgcacatttc cttgcattgg tattccgacc attttcttat 24 0 
gggtgagcct ccgccgatgt ccgcatactg cttttgaatc ttgggtgtgc gtcgtttcga 300 
aataagaggc ccgaggtaat gctggaactt gccaagagga atcaaatcgc cgtcggcctt 360 
gaatagaagt agaatgttag aaacgtagca accagaatga cagcttgcca tagtcggaga 
cgtacaaaga gccggctgag gaaatcctct acttcgtctg tcgtcgaggg ccctcccatg 



420 
480 



ttcaggaaga ccatggctgt agggccctta gagcctgttg catcctgggt aaccggaggc 540 



600 



actgttgttg ccagcccaca tctttgttct tgcttgtatc cgaacagggt gcgagaagcc 
ggtcgcagca attgccgggg cagggtaaac gggcggcgga gagccatgac aggtaattgt 660 
actgaattcg gttgacctag tcaatggagg taataagaaa agaccgttcg tatcgcgcaa 720 



1 



Best Available Copy 



gcagatgaac 


tattcacgcc 


gcattaaata 


ttcaaaagat 


ggacgagtgg 


caagaacagg 


780 


tagtgggtgt 


atacaacagc 


gcaaggcctt 


ctggaagctg 


aaaagtccag 


• m 

aacggcttga 


840 


tgacggagca 


ccgagaccac 


gaccaactcc 


gactcccgac 


agccaatgac 


cggccagcta 


900 


gcgtcatcaa 


ttaccgggcg 


gacatcacat 


gatgttcgtg 


tctccccgcg 


tctttctgcc 


960 


caccggtttg 


atcgcgtccc 


tcgcgaccgg 


atccagtgac 


gatatagata 


gatctatctc 


1020 


cggctgcagg 


cagcagaggc 


caaacaggca 


gacacaacag 


ccccacttgt 


tcctggttac 


1080 


gattcaagtt 


gtcttaacct 


ttatacttcc 


ctctttcaat 


ttcgataata 


tcttgaatgc 


1140 


tttaaacgat 


tccacaacat 


tctactatgg 


cggacggcaa 


cccacatcgg 


gaagatgagg 


1200 


cggccgagga 


agaagaggag 


attgatgaga 


ctgtacgcaa 


atttacccat 


gaacttggac 


1260 


tggaactctg 


gaactgacaa 


taagatcaga 


gctacaaacc 


agtcaaagat 


gcggtcctct 


1320 


tcgcaatcga 


tgtcagcgat 


tccatgttga 


cgcctcgccc 


ctcagcagat 


cctaagaaac 


1380 


acacccaaga 


atcacccacc 


acggcagcgc 


tcaaatgcgc 


ctatcacttc 


atgcaacaac 


1440 • 


aaatcatat c 


aaatccacaa 


aacataataa 


crtattt tact 


attcaaaacc 
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caaacatcca 
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1500 


agttctttga 


agaagatgaa 


gacagtcggg 


gagacctgtc 


ctaccccaac 


tgctacctct 


1560 


tcactgatct 


ggatgttcct 


tcggctcatg 


aggtcaaagg 


acttcgagca 


ctggtagatg 


1620 


atgaaggaga 


ctcaagggag 


gttctatctc 


cagcgaaaga 


gcaggtctct 


atggcaaacg 


1680 


tcctattttg 


cgccaaccag 


atattcacat 


ccagagcgcc 


aaatttcctc 


tcccggcgtt 


1740 


tgttcatcat 


aaccgacaat 


gacaaccccc 


atggtgatga 


taaaaccctg 


cggtcagcgg 


1800 


cgactgtacg 


tgctaaggat 


ctttacgatc 


ttggtgtcac 


aattgagctg 


tttccgatct 


1860 


cacgccctga 


gcatgagttc 


aagaacagca 


agttctatga 


cgtaagctat 


catactctat 


1920 


agcaaagtgg 


caggggtcga 


tactcactac 


agatacaaag 


gatattatct 


acaagtcatt 


1980 


gcccagcgat 


ccagaggcgc 


ctgcatatct 


acaatctgat 


tcaaaagcgg 


cgactgcgac 


2040 


cggggacggg 


atttcactcc 


tcaacacgct 


tctgtccagt 


attaattcga gaacggttcc 


2100 


ocatcocact 


catt 1 1 1 coa 


acatacctt t 


aaaacttqac 


ccagacttca 


gaatttcggt 


2160 


atcgggctat 


atactcttac 


gaaggcaagc 


gcccgctaga 


aactccttca 


tctggctgaa 


2220 


cggcgagaag 


cctgtggtcg 


cgaaaggagt 


gacttcccac 


tccgcagatg 


atactggccg 


2280 


gactgtcgag 


aaatgggaga 


tcagaaaggc 


atataagttc 


ggtggcgacc 


aagtaacctt 


2340 


ttcgcctgat 


gagcagaagg 


cgcttaggga 


tttcggtgag 


ccagtaatcc 


gggttattgg 


2400 


gttcaagcct 


atcactgcgc 


ttccattctg 


ggcaaacgtc 


aagcacccat 


attttatcta 


2460 



tccatccgag gaagactatg taggctcctc gcgagtattt tccgcattgc atcagactct 2520 

tttgcgttcc aagaagatgg cactcgtctg gttcattgca cgcaagggtg ctggccccgt 2580 

tctcgccgct atgatcgcag gcgaagaaaa gcttgatgag aatggcgtac aaaaataccc 264 0 

tcctggcatg tggattcttc ccctcccctt cgcagacgat atccggcaga accccgaaac 2700 

aacgttgaat gtcgccccgg agtcattgat tgatcagatg cgcgtgatcg tccagcaact 2760 

gcagctgccg aagggagtgt acgagcctct caaatacccc aatccatgta agtcacttct 2820 

gtcttgcatt gctcgtatac gatgaacgag aagctgacag cccgtgatca gcccttcaat 2880 

ggcattaccg catcctacaa gctctcgcat tagacgaaga tctccccgaa aaaccagaag 2940 

acaaaaccat tccgaaatac cgccaaatcg acaaggtaaa tccaccacac ccaacacgag 3000 

aaataaccct ccaggcgtcc aacttactga caattgcacc acagcgcgcc ggtgactacg 3060 

tattatcctg ggccgacgaa ctcgaaaagc aatacgccaa aacctcagca gcggcccctc 3120 

gcccaaccag caccctcgtg aaacgaggat caaaagaccg agcaagcgaa accgaggact 3180 

ccaagccatc gaaaaagatc aaggttgagg aagactctgg aagcctagag gaggaagtcc 324 0 

gcaggcatca caagaaggga acgctatcca aggtaagcca ccacaggctt tctacacgtc 3300 

ctcgtgatgg caaatatgac atcgtattaa ccggcggttt tctagcttac ggtcgctatc 3360 

ctcaaggact tcttgacttc caatggacgc tcaaatgccg gtaagaaggc ggatcttatt 3420 

gagcgggtag aggagttctt ggagcagtga catggcggga ttgttggatt cgctagtgcg 3480 

cttctgttgg tggatgtcgt tatgtggtgt cttatctcgg gttaggcgtt cgtgacctga 354 0 

ggacatgagc ttgtaattaa tgatgggttg gatgtcgcgg tattcgttct tcagcgaaac 3600 

gtaatggaca cgtattttag gcgatgtaca gttataaaaa atcgaattcg ctgggctagc 3660 

cggacatgtc aaaacgaaga gcattaggag agacatatca ggtccaagtg ctatctttca 3720 

aaccagtcgc ttaagatcac cgaggcattt atctccagaa aattcaccgg ttcagcaggt 3780 

gcgcgtatcc cgaattcaaa ttaatattgg aacgatcgta aataaccacc cagattcgcc 3840 

gtaaacgata gtagtcaggc tttgccgccg acagaagggg acgagtatgt caactgagtc 3900 

aacttgaacc gagcagtccc tgtaaacaac gccacgctgt ttgtaatatc cctttagaaa 3960 

cgtattgtcg ctggcaatta tccacaaaaa atgagtctaa acgggcgaaa aaagtcaccc 4 020 

gaatgggaga atatgtggaa agaagaaaga aagagagacc aaagcaagag agcgccgaaa 4 080 
ggaagctatc gtaatatata caagtagaag ccgtgggtat ttttataaaa agcagaaacg 4140 



Best Available Copy 



ttaacggtat gcgtacaatg atcaacattg tccataaact tgacagtagc agacttcttc. 4200 

gtcgggacag ctgagagtag caaagtgtta gtatttagga cgcattcagc aggtaggggg 4260 

ggaggggtgc aaaggcaaca tactatattg attctttgcc gaatatgaca tgccagagaa 4320 

attccatgac acggccacta ctggcgtcat ccttgtcggt atcgattatc cactggcgga 4380 

tcttgatgta gtcctctcgt ggtcgtpggt ggacctgctc ccgggacacg gcgaattgcg 4 44 0 

cacagcacgc cgcgccaatc tgtttcggca tttgcaggaa cttctggtat ttagcttcgt 4 500 

c 4501 

<210> 8 
<211> 4702 
<212> DNA 

<213> Aspergillus nxger 
<400> 8 

cactcaggat tcttatatct tatggggccc aagtatctct gcagtcaggc gaccaaggaa 60 

cagcgttgca cgctgctgtg gtgggtggct ttcacggctc agtcaatttg cttctgacag 120 

aaaatgcgga ggttgatgca tcatgtacac tcattgggac cccgttggcg gcagtcatgc 180 

ctcgtaaatg gaagtcctgc tgt-ggtcgtt accatcgaag ctgtgctgag caattaattg 24 0 

cttggggtgc agatattgat cgcattgatg aacgtctggg gactccaatg gacattgcat 300 

acaaggcagg aaacaatgag ggtgtggagc tgttacttga gaatggagca ctggatccca 360 

aatccacagc gtatccacta aattcggaca attgaccggg cacgaagtgc ttttctgttt 420 



gagatatata tggagcactg aagaaaataa tcagagactt gccgtacttg aaaacttgga 480 

gaaatgatcg gatcggtaaa tgtccaattt gccctgggtg tctgggctcg caagacccct 54 0 

ttaaaataat atagacattc acgcactact cgcagcaaat cttaacaatt tgggcttgtc 600 

taagctctgg gagatcacta atttattata gaaccttcaa atgtcgatta gtatgtgaga 660 

gttatcttgt caattcagcc tgttagtaca ataaaaccca ctcatagcgg ctccgtcata 720 

tataaaattg tgcactacac tcacttgcga tatatgatgc gcagacaccc atgttagtat 780 

ctgcaatgtc acttcaattt cgccaacaaa ggaccctcca taaagtagct actctgcaat 84 0 

ttaaatcact agacttgtat cacaaatcat gtaaataaag caatacggag tataagctgc 900 

ccactgcatg cccctcttag taagcaccca ctgcatgatg tcatgtgctt tcgcgtcccg 960 

cccgctccca atcgggaaat atcacgcgtc tgcctactca gagtgcatct ttctgccttg 1020 

agctcgtccc ttttatgtcg agccagctgc ggcatcgaat ggatctgatt ccatcgataa 1080 



tctcagtcat tcatactgaa aatggccgat aaagaggcaa ctgtctacat cgtggactgc 1140 

ggcaagtcca tgggggagcg gcgtcatggt cgcgaagtga cggatctcga ctgggcgatg 1200 

caatatgttt gggatcgtat tacagggacg gtgagatcct tattcttgag aatcatatca 12 60 

tacatgaaag cttatgtttt ggataggtgg ccactggacg aaaaatggct ttgatcggtg 1320 

ttcttgggct caggacagat ggtgagtgac tagcctcccg ggtacagttg gtagttgtag 1380 

tttgctggtc ggggctaatg caggaacgtc cagaaaccgc taatgagttg gaggatgatc 1440 

ctgattattc gcatatctcg gttttgtctg ggattaaaca gtatgattca tttttgtctg 1500 

ctgatcctct ggttattcgc tgatgaacta taggtttctt atgccggata tccggggttt 1560 

gagcgaccga ataaagccta gcaagactaa taagggagat ggtgagttac tcttcttgta 1620 

tggaattgga gtgattgggg ctgagccgat gaatatagct atctctgcac ttgtgctcgc 1680 

gattcagatg attatcactc agtgcaagaa actgaagtac aagcgcagga ttgtcctggt 174 0 

tanr.a3t.ggg cagggnccga tgaacccgga taatcttagt gaaataacga agaagattaa 1800 

ggaggataac attgaactta ttattctgtt agtgtcaatt gatacactga gagaaccggg 1860 

gtactaacat gctgcagggg accagacttt. gatgatcctg aatatggggt gaaagaggaa 1920 

gataaagatc cgcgaaaggt atttaacttc gttccatatg ctctagacta ataataacaa 1980 

tggctacagg ccgaaaatga aacactcctg cgtagtcttg ccgaagactg cgaaggagcc 2040 

tatggaaccc tagaacaagc tgttgcggag ctggaaactc ctcgtgtgaa aaccacaagg 2100 

ataacagcaa gcttcaaggg ccatttgcaa ctaggaaacc ccgcagaata tgatactgca 2160 

gttcggatcc ctgtggagcg ctactacagg acatacgttg caaaagctcc gtcggctagt 2220 

cagttcacag tacgtaacga agaggagatg ggaatggccg cggccgcagc cggctcgcag 2280 

i 

gaaggtagtt cccttgtggg tgttcgaaac aacaggtcct accaaattga cgatigggact 234 0 

actgaagaag gggtgaggga cgtggatcga gagcaacttg ccaagggtta tgagtacggg 2400 

cggacattgg tccctattag cgagacggat gagaatatca ccaccctaga gacatttgcg 24 60 

gctatcgagc ttcttgggtt tatacagagc gatcgggtga gttctaccct ccaataactg 2520 

ttattatgct gctaagtggg ttttgccatt agtatgatcg atacatgcac atgtcgacga 2580 

caaacatcat catcgcgcag cgcgcgaatg acaaggcagc actcgctctt tcctctttca 2 64 0 

tacatgcgct gttcgagctg gaatcgtacg ctgtcgcccg tatggtgcta aaggagaaca 2700 

aaccccctgt catagtcgtg cttgcgccat caatcgaacc cgactacgag tgtctcctcg 27 60 

aagcgcagtt gccattcgca gaagacgtac gaacgtaccg cttccctcca ctcgacagag 2820 



Best Available Copy 



tcattacagt 


gtctggtaaa 


gtggtgacac 


agcatcgaaa 


cctacccaac 


gacgatctgt 


^ ^ 

2880 


tgaatgcgat 


ggacaaatac 


gtgaaaagca 


tggagcttac 


cgatatggac 


gagaaegggt 


294 0 


gagaagaatt 


ggaagtgatc' tcaacttcac 


tgctgacttt 


gtacaaagtg 


acccgacgga 


^"v 

3000 


atctctccca 
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agtcctgcac 


eggatcgact 


ccgcaatccg 


3060 


tcaccgtgcc 
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cccgccccca 


gcctcagtcc 


taacgaagtt 


3120 


ctcccaccct 




tLy LUy ay ad 


gtccaagaaa 


tacctagaca 


agctagtagc 


3180 


agtgtcggac 




y LCay LCCaL 


ctcggccttg 


agectcttag 


gcccccatca 


3240 


tactcacagt 


gaugaaiictia 


y tcccaccaa 


aaaccaaagg 


caccaaacgg 


acccgcgaaa 


3300 


ccgagaagcc 


actatccggt 


ctcgacgtcg 


atgcccttct 


ccaccaagag 


aagegcaega 


3360 


agatctcacc 


caacaacgca 


attcccgagt 


ttaagcagac 


gctctcgcag 


gcagagaaca 


3420 


tcgagatcat 


caaggatgca 


gtgaagcaga 


tgagcactat 


cattgaagac 


caaatcaggc 


3480 




cgatgtitaat 


tatcatcggg 
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3540 


aactgatcga 


ttatgaggaa 


cctgctctgt 


ataacgattt 


cttgaagcag 


ctgaaggaga 


3600 


agttgttgaa 


agaggagctc 


gguggggatc 


gaegggaget 


gtggtggctg 


ctaagaagga 


3660 


gtaagttggg 

• 


gttgat tgaa 


cagagggagt 


cggaacactc 


tgaggtgaga 


gaagaggaag 


3720 


cgaaggcgtt 


tatgtctatg 


gctgcuaagt 


gagcagaccg 


ttattgatcc 


ctattagtcc 


3780 


ccgattaagg 


actgggcaac 


agttcgataa 


tgacaaatga 


acaagctcca 


atgctgcatg 


3840 


actgtgctcg 


ctagagtaca 


acaitcacga 


taaccctgcg 


ctaagtaaca 


aggcttatcc 


3900 


catgccaaat 


gtaacacaca 


taacatataa 


taccaaattc 


gatgaacagt 


acaegggata 


3960 


t-nnatcatga 


ccatgagtag 


aaatagacat 


cgcaagcaac 


cattatatcc 


acacactcaa 


4020 


gaaattctcc 


caatcctctt 

* 


cttcccaata 


tcaatcttcc 


cacccaacct 


atattcaagt 


4080 


cagcacaact 


ttaccatcaa 


aaagtaagaa 


caagatggga 


aaaaagaaca 


tacctegtag 


4140 


catcctcatc 


cgccaaattc 


aacaact cat 


tctgcacctg 


caactcattg 


ttaattgcaa 


4200 


tccccaactc 


cttctgccga 


ttgacaatcc 


tcatcaactc 


atccaccgcg 


acatcctgat 


4260 


cctccatcat 


ctgcttctgc 


aactgcacca 


ccccactatt 


atccaactcg 


cgcgtcctct 


. 4320 


ccgtctccct 


ccccaacact 


ctccccgacg 


accgaatcgc 


cttcttgccc 


ccctgcgtcc 


4380 


ccatcaacgc 


ctccttatcc 


tgaatcgacg 


ccaccgcact 


atcaatccga 


ctcttcgccg 


4440 


ccatcgcatt 


caacagatcc 


tccagtccat 


ctttctcctt 


cctcgcgttg 


atgagtagat 


4500 



ccttccgtcg tcgcatctct ccttccccga gcgtgttgtt cccactaaca ctccatgccg 4 560 

tggcggtcgt agcrtgtattt ctgcgcccga gtttacttcc tggtcttgac cctgcattat 4 620 

* 

tttcctcccc acctccagaa gaagagttac tcaaattcct caacccactc tccaaactcg 4 680 

caatcaatcc cccggcccta ac 4 702 
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